Big Data Concept + Tools + Techniques Training
Big Data Concept + Tools + Techniques Training
Big Data Concept + Tools + Techniques E-Learning Training Online Gecertificeerde docenten Quizzen Assessments test examen Live Labs Tips trucs Certificaat.
Lees meer- Kortingen:
-
- Koop 2 voor €390,04 per stuk en bespaar 2%
- Koop 3 voor €386,06 per stuk en bespaar 3%
- Koop 4 voor €382,08 per stuk en bespaar 4%
- Koop 5 voor €378,10 per stuk en bespaar 5%
- Koop 10 voor €358,20 per stuk en bespaar 10%
- Koop 25 voor €338,30 per stuk en bespaar 15%
- Koop 50 voor €318,40 per stuk en bespaar 20%
- Beschikbaarheid:
- Op voorraad
- Levertijd:
- Voor 17:00 uur besteld! Start vandaag. Gratis Verzending.
- Award Winning E-learning
- De laagste prijs garantie
- Persoonlijke service van ons deskundige team
- Betaal veilig online of op factuur
- Bestel en start binnen 24 uur
Big Data Concept + Tools + Techniques E-Learning Training
In de moderne wereld worden gegevens in een exponentieel tempo gegenereerd. Het genereren van zakelijke gegevens neemt in een even snel tempo toe. Slechts een klein percentage van de bedrijfsgegevens zijn gestructureerde gegevens in rijen en kolommen van databases. Deze dataproliferatie vereist een heroverweging van traditionele technieken voor het vastleggen, opslaan en verwerken. Big data is een term die datasets beschrijft die zo groot zijn dat ze niet kunnen worden beheerd met traditionele databasesystemen. Big Data is ook een verzameling tools en technieken om deze problemen op te lossen.
Learning Kits zijn gestructureerde leertrajecten, voornamelijk op het gebied van Emerging Tech. Een leerpakket houdt de student werkt aan een algemeen doel, hen te helpen uw loopbaanambities te verwezenlijken. Elk deel leidt de student stap voor stap door een diverse reeks onderwerpen. Leerpakketten zijn:bestaande uit verplichte tracks, die alle beschikbare leermiddelen bevatten, zoals assessments (eindexamens), mentor, oefenlabs en van cursus e-learning. En alle bronnen met 365 dagen toegang vanaf de eerste activering.
Deze Learning Kit, met meer dan 25 uur online content, is onderverdeeld in de volgende tracks:
Cursusinhoud
Big Data Infrastructures
In this learning, the focus will be on big data concepts, non-relational data, and big data analytics.
Courses (7 hours +)
The Big Data Technology Wave
Big Data in Perspective
Course: 17 Minutes
- Course Introduction
- Introducing Big Data
- The Biggest Wave Yet
- Emerging Technologies
Global Data
Course:14 Minutes
- Defining Big Data
- Key Terms for Data
- Sizing Big Data
The Key Contributors
Course: 10 Minutes
- The Original Key Contributors
- The Distro Companies
The Apache Software Foundation
Course: 10 Minutes
- Apache Software Foundation
- Apache Projects
- Other Apache Projects
- Other Open Source Projects
Big Data Stack
Course: 13 Minutes
- The Big Data Stack
- Big Data Components
- NoSQL Databases
Hadoop in Detail
Course: 31 Minutes
- Distributed Computing
- Design Principles of Hadoop
- Functional View of Hadoop
- HDFS in Action
- Yarn in Action
- MapReduce in Action
- Spark in Action
Practice: Big Data elements and functions
Course: 15 Minutes
- Exercise: Working with Big Data Elements
Big Data Opportunities and Challenges
Big Data Teams
Course: 28 Minutes
- Course Introduction
- The Big Data Team
- Business Team Members
- Analytics Team Members
- Data Solutions Team Members
- Cluster Team Members
- Big Data Impacting IT
Big Data Projects
Course: 25 Minutes
- DIY Supercomputing
- Hadoop in the Clouds
- Big Data and Data Warehouses
- Business Case for Big Data
- Big Data and RDBMS
- Data Center Projects
Big Data Use Cases
Course: 20 Minutes
- Data Analytics
- Big Data Engines
- Common Analytics Use Cases
- Big Data Impacting the Globe
Opportunities and Challenges
Course: 32 Minutes
- Global Increasing Digital Volume
- The Big Companies
- Big Data Opportunity
- Big Data Challenges
- Challenges of Security and Privacy
- Planning for Big Data
- Big Data Impacting Business
- Practice: Challenges and Opportunities of Big Data
- Exercise: Challenges and Opportunities of Big Data
Big Data Concepts: Getting to Know Big Data
Course: 43 Minutes
- Course Overview
- What Is Big Data?
- Sources of Big Data
- Characteristics of Big Data
- Structured and Unstructured Data
- Big Data Analytics
- Advantages of Big Data Analytics
- Big Data Analytics: Domain Use Cases
- Big Data Analytics: Netflix Use Case
- Big Data Analytics: Amazon Use Case
- Major Challenges in Big Data
- Course Summary
Big Data Concepts: Big Data Essentials
Course: 46 Minutes
- Course Overview
- Raw Data and Big Data
- Data Warehousing and Big Data
- Big Data Computing Systems
- Horizontal and Vertical Scaling
- Features, Benefits, and Use Cases of Hadoop
- Hadoop: Components
- Hadoop: Migration to the Cloud
- Hadoop and Cloud Computing
- Features of Big Data Storage Systems
- In-memory Storage Systems
- Course Summary
Non-relational Data: Non-relational Databases
Course: 52 Minutes
- Course Overview
- Non-relational Databases
- The NoSQL Approach
- Benefits of NoSQL
- Document Databases
- Key-value Data Stores
- Graph Databases
- Columnar Databases
- HBase Architecture
- Multi-model Databases
- Next Generation NewSQL Databases
- Course Summary
Big Data Analytics: Techniques for Big Data Analytics
Course: 39 Minutes
- Course Overview
- Big Data Analytics Challenges
- Big Data Analytics Stack Layers
- Big Data Ingestion
- The Data Processing Layer
- The Data Storage Layer
- Pillars of Big Data Architecture
- Batch Processing and Big Data
- Stream Processing and Big Data
- Lambda Architecture and Use Cases
- Kappa Architecture
- Course Summary
Big Data Analytics: Spark for High-speed Big Data Analytics
Course: 51 Minutes
- Course Overview
- The Core Characteristics of Apache Spark
- Components of the Apache Spark Architecture
- Apache Spark Use Case: Uber Using Spark
- Apache Spark Use Case: Alibaba Using Spark
- Apache Spark Use Case: The Healthcare Industry
- Apache Spark vs. Hadoop
- Top Apache Spark Use Cases
- Apache Spark's Main Features
- Apache Spark Performance Optimization Techniques
- Apache Spark Best Practices
- Course Summary
Harnessing Data Volume & Velocity: Big Data to Smart Data
Course: 39 Minutes
- Course Overview
- Comparing Big Data and Smart Data
- Smart Data and Edge Technologies
- Big Data to Smart Data Formation
- Smart Data and Smart Processes
- Smart Data Use Cases
- Smart Data Life Cycle
- Big Data to Smart Data Using k-NN
- Smart Data Frameworks
- Smart Data to Business
- Clustering Smart Data
- Smart Data Integration
- Exercise: Transform Big Data to Smart Data
Securing Big Data Streams
Course: 1 Hour, 3 Minutes
- Course Overview
- Big Data Security Concerns
- Streaming Data Security Concerns
- NoSQL Database Security Concerns
- Distributed Processing Security Risks
- Data Mining and Analytics Privacy Flaws
- End-Point Device Tampering Risks
- Secure Big Data
- Secure Data Streams
- Secure Data In Motion
- End-Point Input Validation and Filtering
- Secure Data at Rest with Symmetric Ciphers
- Exercise: Securing Big Data Streams
Assessment:
- Big Data Infrastructures
Emerging New Age Architectures
In this learning, the focus will be on cloud data platforms, data lakes, and modern warehouses.
Courses (5 hours +)
Cloud Data Platforms: Cloud Computing
Course: 52 Minutes
- Course Overview
- Cloud Computing and Its Characteristics
- Cloud Computing: Use Cases and Benefits
- Cloud Computing Services: Storage and Compute Power
- Types of Cloud Compute Power
- Types of Cloud Storage
- Cloud Computing Models: PaaS, IaaS, SaaS, and FaaS
- Cloud Computing Model Comparison
- Components of Cloud Computing Architectures
- Cloud Service Provider Comparison
- Cloud Elasticity and Scalability
- Course Summary
Cloud Data Platforms: Cloud-based Applications & Storage
Course: 53 Minutes
- Course Overview
- Deploying Applications on Cloud Platforms
- Characteristics of Cloud-ready Applications
- Types of Cloud Deployment Models
- Cloud Deployment Tools
- Considerations for Cloud Application Deployment
- CPU Virtualization, Memory, and I/O Devices
- Cloud Storage Platforms
- Cloud Storage Technologies
- HDFS and Amazon S
- Types of Data Centers
- Course Summary
Cloud Data Platforms: AWS, Azure, & GCP Comparison
Course: 56 Minutes
- Course Overview
- Cloud Data Platforms: Amazon Web Services
- Cloud Data Platforms: Microsoft Azure
- Cloud Data Platforms: Google Cloud Platform
- Cloud Analytics
- Popular Cloud Analytics Tools
- Cloud Computing Challenges: Security
- Cloud Computing Challenges: Compliance
- Cloud Computing Challenges: Cost Management
- Cloud Computing Challenges: Governance
- Future of Cloud Computing
- Course Summary
Data Lakes and Modern Data Warehouses: Data Lakes
Course: 1 Hour, 19 Minutes
- Course Overview
- Data Lake Evolution
- Modern Data Lake Architecture
- Data Lakes: Key Concepts
- Data Lake Maturity Stages
- Data Swamps
- Data Lake Platforms
- Data Lake Platforms
- Governed Data Lakes
- Data Lakes: Risks and Challenges
- Data Lakes vs. Data Warehouses
- Course Summary
Data Lakes and Modern Data Warehouses: Modern Data Warehouses
Course: 1 Hour, 10 Minutes
- Course Overview
- Data Warehouses and Its Characteristics
- Modern Data Warehouses: Key Concepts and Stages
- Amazon Redshift
- Google BigQuery
- Modern Data Warehouses: Architecture and Processes
- Modern Data Warehouses: Techniques
- Data Warehouse Solutions: Batch Processing
- Data Warehouse Solutions: Real-time Processing
- Data Warehouse Solutions: Streaming Analytics
- Hybrid Modern Data Warehouse
- Course Summary
Data Lakes and Modern Data Warehouses: Azure Databricks & Data Pipelines
Course: 1 Hour, 2 Minutes
- Course Overview
- Azure Databricks: Features and Architecture
- Azure Databricks: Pros and Cons
- Snowflake Data Warehouses: Features and Architecture
- Snowflake Data Warehouses: Pros and Cons
- Data Pipelines
- Components of a Data Pipeline
- Advantages of a Data Pipeline
- Types of Data Pipeline Tools
- Comparing Data Pipeline Tools
- Building a Data Pipeline
- Course Summary
Assessment:
Emerging New Age Architectures
Apache Spark
Explore the basics of Apache Spark, an analytics engine used for big data processing.
Courses
Accessing Data with Spark (3 hours+)
Accessing Data with Spark: An Introduction to Spark
Course: 1 Hour, 7 Minutes
- Course Overview
- Introduction to Spark and Hadoop
- Resilient Distributed Datasets (RDDs)
- RDD Operations
- Spark DataFrames
- Spark Architecture
- Spark Installation
- Working with RDDs
- Creating DataFrames from RDDs
- Contents of a DataFrame
- The SQLContext
- The map() Function of an RDD
- Accessing the Contents of a DataFrame
- DataFrames in Spark and Pandas
- Exercise: Working with Spark
Accessing Data with Spark: Data Analysis Using the Spark DataFrame API
Course: 1 Hour, 12 Minutes
- Course Overview
- Performance Improvements in Spark
- Broadcast Variables and Accumulators
- Loading Data into a DataFrame
- Sampling the Contents of a DataFrame
- Grouping and Aggregations
- Visualizing Data in a DataFrame
- Trimming and Cleaning Data
- User-Defined Functions and DataFrames
- Combining Filters, Aggregations, and Sorting
- Using Broadcast Variables
- Using Accumulators
- Exporting DataFrame Contents
- Custom Accumulators
- Join Operations
- Exercise: Data Analysis Using the DataFrame API
Accessing Data with Spark: Data Analysis using Spark SQL
Course: 55 Minutes
- Course Overview
- The Spark Catalyst Optimizer
- Introduction to Spark SQL
- Preparing Data for Analysis
- Running SQL Queries
- Inferred and Explicit Schemas
- Windowing in Spark
- Applying Window Functions
- Exercise: Data Analysis Using Spark SQL
Big Data Development with Apache Spark (5 hours+)
Introduction to Apache Spark
Course: 1 Hour, 2 Minutes
- Course Introduction
- Overview of Apache Spark
- Downloading and Installing Apache Spark
- Downloading and Installing Apache Spark on Mac OS
- Building Spark
- Working with Spark Shell
- Linking to Spark
- Spark Configuration
- Initializing Apache Spark
- Running Spark on Clusters
Apache Spark SQL
Course: 1 Hour, 10 Minutes
- Course Introduction
- Apache Spark SQL Overview
- SparkSession
- DataFrames
- Aggregations
- SQL Queries
- Temporary View
- Datasets
- JSON Datasets
- Load/Save Functions
- Specifying a Data Source
- Querying with SQL
- SaveMode
- Parquet Files
- Persistent Tables
- Partitioning
Structured Streaming
Course: 1 Hour, 13 Minutes
- Course Introduction
- Structured Streaming Overview
- Stream Input
- Stream Output
- Windowing
- Continuous Applications
- Deduplication
- File Sinks
- Streaming Query
- Streaming Query Manager
- Checkpointing
- Word Count
Spark Monitoring and Tuning
Course: 59 Minutes
Monitoring Spark Applications
Course: 17 Minutes
- Course Introduction
- Web UI
- Environment Configuration
- REST API
- Memory Allocation
Tuning Spark Applications
Course: 38 Minutes
- Speculation
- Serialization
- Memory Tuning
- Executor Memory
- Garbage Collection Tuning
- Parallelism
- Broadcast Functionality
- Explain Query Execution
- Data Compression
Practice: Monitoring Spark Applications
Course: 4 Minutes
- Exercise: Monitor Spark Applications4
Spark Security
Course: 36 Minutes
- Course Introduction
- Spark UI
- Secure Event Logs
- SSL Settings
- Shared Secret
- YARN Deployments
- SASL Encryption
- Network Security
Practice: Configuring Spark Security
Course: 3 Minutes
- Exercise: Configure Spark Security
Practice Lab:
Developing with Apache Spark (5 hours)
Practice developing with Apache Spark by performing tasks with Spark SQL, Spark Streaming, and GraphX. Then create a classification system using MLib and work with MLib Regression.
Apache Hadoop
Apache Hadoop is an open-source framework for the storage and processing of big data.
Courses
Getting Started with Hadoop (5 hours+)
Introduction to Apache Spark
Course: 1 Hour, 2 Minutes
- Course Introduction
- Overview of Apache Spark
- Downloading and Installing Apache Spark
- Downloading and Installing Apache Spark on Mac OS
- Building Spark
- Working with Spark Shell
- Linking to Spark
- Spark Configuration
- Initializing Apache Spark
- Running Spark on Clusters
Apache Spark SQL
Course: 1 Hour, 10 Minutes
- Course Introduction
- Apache Spark SQL Overview
- SparkSession
- DataFrames
- Aggregations
- SQL Queries
- Temporary View
- Datasets
- JSON Datasets
- Load/Save Functions
- Specifying a Data Source
- Querying with SQL
- SaveMode
- Parquet Files
- Persistent Tables
- Partitioning
Structured Streaming
Course: 1 Hour, 13 Minutes
- Course Introduction
- Structured Streaming Overview
- Stream Input
- Stream Output
- Windowing
- Continuous Applications
- Deduplication
- File Sinks
- Streaming Query
- Streaming Query Manager
- Checkpointing
- Word Count
Spark Monitoring and Tuning
Course: 59 Minutes
Monitoring Spark Applications
Course: 17 Minutes
- Course Introduction
- Web UI
- Environment Configuration
- REST API
- Memory Allocation
Tuning Spark Applications
Course: 38 Minutes
- Speculation
- Serialization
- Memory Tuning
- Executor Memory
- Garbage Collection Tuning
- Parallelism
- Broadcast Functionality
- Explain Query Execution
- Data Compression
Practice: Monitoring Spark Applications
Course: 4 Minutes
- Exercise: Monitor Spark Applications
Spark Security
Course: 36 Minutes
- Course Introduction
- Spark UI
- Secure Event Logs
- SSL Settings
- Shared Secret
- YARN Deployments
- SASL Encryption
- Network Security
Practice: Configuring Spark Security
Course: 3 Minutes
- Exercise: Configure Spark Security
Working with Hadoop HDFS (3 hours+)
Hadoop HDFS: Introduction
Course: 1 Hour, 15 Minutes
- Course Overview
- Scaling Datasets
- Horizontal Scaling for Big Data
- Distributed Clusters and Horizontal Scaling
- Overview of HDFS
- HDFS Architectures
- MapReduce for HDFS
- YARN for HDFS
- The Mechanism of Resource Allocation in Hadoop
- Apache Zookeeper for HDFS
- The Hadoop Ecosystem
- Exercise: An Introduction to HDFS
Hadoop HDFS: Introduction to the Shell
Course: 53 Minutes
- Course Overview
- Creating a Hadoop Cluster on the Google Cloud
- Exploring Hadoop Clusters
- The YARN Cluster Manager UI
- The HDFS NameNode UIs
- Browsing the Packaged Hadoop Tools
- Configuring HDFS
- The HDFS Shells
- Exercise: Introduction to the HDFS Shell
Hadoop HDFS: Working with Files
Course: 48 Minutes
- Course Overview
Basic Directory Commands in HDFS - Using the copyFromLocal Command in HDFS
- Using the put Command in HDFS
- Using the copyToLocal Command in HDFS
- Retrieving files from HDFS
- Append and Delete Operations in HDFS
- Exercise: Working with Files on HDFS
Hadoop HDFS: File Permissions
Course: 49 Minutes
- Course Overview
- The HDFS count and du Commands
- Viewing and Setting File Permissions in HDFS
- Applying Permissions Recursively in HDFS
- An Introduction to Bash Scripting
- Scripting HDFS Operations
- Exploring the HDFS NameNode UI
- Cleanup Operations in HDFS
Data Warehousing with Hadoop (4 hours+)
Data Warehousing with Hadoop: Managing Big Data Using HDInsight Hadoop
Course: 1 Hour, 6 Minutes
- Features of HDInsight
- Fundamentals and Types of Clusters in HDInsight
- Essential Opensource Components of HDInsight
- Setting Up Hadoop Clusters on Azure HDInsight
- HDInsight Clusters with Resource Manager Template
- HDInsight Services and Storage Types
- Azure Management Console
- Creating and Managing HDInsight Clusters
- Setting Up HDInsight Emulator
- Programming in HDInsight
- Developing and Executing MapReduce Program
- Exercise: Working with HDInsight and MapReduce
Data Warehousing with Hadoop: Microsoft Analytics Platform System and Hive
Course: 1 Hour, 29 Minutes
- Microsoft Analytics Platform System
- Understanding PolyBase
- Parallel Data Warehouse Architecture
- Data Exploration Architectures
- Hive Introduction
- Hive Architecture in HDInsight
- Setting up the Development Environment for Hive
- Connect and Submit Queries
- Hive QL
- Using Azure PowerShell and Beeline
- Creating a Database and Tables and Loading Data
- Partition Tables and Data Formats
- Hue Installation and Hive Query Management
- Using Microsoft BI and Hive
- Hive as ETL
- HBase and Hive
- Exercise: Creating and Loading Data into Hive Tables
Data Warehousing with Hadoop: HDInsight and Retail Sales Implementation Using Hive
Course: 46 Minutes
- Data Modeling
- Dimensional Design Process
- Dimensional Design Steps
- Retail Business Use Cases
- Dimension Tables
- Fact tables
- Data Loading in Dimension and Fact Tables
- Essential Queries
- Creating and Executing Queries
- Hive and Power BI for Visualization
Data Warehousing with Hadoop: Spark, HDInsight and Cluster Management
Course: 56 Minutes
- Spark Introduction
- Data Representation in Spark
- Create Spark Clusters Using PowerShell
- Spark SQL and Hive
- Spark SQL Data Sources and DataFrames
- Customizing HDInsight Cluster
- Application Installation on HDInsight
- Ambari User Management
- HDInsight Management Using Azure CLI
- Troubleshooting HDInsight
- Monitoring HDInsight Hadoop
- Exercise: Working with Spark and Ambari
Taal | Engels |
---|---|
Kwalificaties van de Instructeur | Gecertificeerd |
Cursusformaat en Lengte | Lesvideo's met ondertiteling, interactieve elementen en opdrachten en testen |
Lesduur | 25 uur |
Assesments | De assessment test uw kennis en toepassingsvaardigheden van de onderwerpen uit het leertraject. Deze is 365 dagen beschikbaar na activering. |
Online Virtuele labs | Ontvang 12 maanden toegang tot virtuele labs die overeenkomen met de traditionele cursusconfiguratie. Actief voor 365 dagen na activering, beschikbaarheid varieert per Training. |
Online mentor | U heeft 24/7 toegang tot een online mentor voor al uw specifieke technische vragen over het studieonderwerp. De online mentor is 365 dagen beschikbaar na activering, afhankelijk van de gekozen Learning Kit. |
Voortgangsbewaking | Ja |
Toegang tot Materiaal | 365 dagen |
Technische Vereisten | Computer of mobiel apparaat, Stabiele internetverbindingen Webbrowserzoals Chrome, Firefox, Safari of Edge. |
Support of Ondersteuning | Helpdesk en online kennisbank 24/7 |
Certificering | Certificaat van deelname in PDF formaat |
Prijs en Kosten | Cursusprijs zonder extra kosten |
Annuleringsbeleid en Geld-Terug-Garantie | Wij beoordelen dit per situatie |
Award Winning E-learning | Ja |
Tip! | Zorg voor een rustige leeromgeving, tijd en motivatie, audioapparatuur zoals een koptelefoon of luidsprekers voor audio, accountinformatie zoals inloggegevens voor toegang tot het e-learning platform. |
Er zijn nog geen reviews geschreven over dit product.
OEM Office Elearning Menu Genomineerd voor 'Beste Opleider van Nederland'
OEM Office Elearning Menu is trots genomineerd te zijn voor de titel 'Beste Opleider van Nederland' door Springest, een onderdeel van Archipel. Deze erkenning bevestigt onze kwaliteit en toewijding. Hartelijk dank aan al onze cursisten.
Beoordelingen
Er zijn nog geen reviews geschreven over dit product.