Big Data Concept + Tools + Techniques E-Learning Training Online Certified Teachers Quizzes Assessments Test Exam Live Labs Tips Tricks Certificate.
Read more.
Bulk discount
No discount
1 Piece
€481,58€398,00
2% Discount
2 Pieces
€471,95€390,04/ Piece
3% Discount
3 Pieces
€467,13€386,06/ Piece
4% Discount
4 Pieces
€462,32€382,08/ Piece
5% Discount
5 Pieces
€457,50€378,10/ Piece
10% Discount
10 Pieces
€433,42€358,20/ Piece
15% Discount
25 Pieces
€409,34€338,30/ Piece
20% Discount
50 Pieces
€385,26€318,40/ Piece
Make a choice
Officieel examen Online of fysiek
Start nu – bekroonde e-learning Inclusief proefexamens & 24/7
ISO 9001 & 27001 werkwijze 1000+ organisaties gingen u voor
Maatwerk & gratis intake Inclusief nulmeting bij training
Product description
Big Data Concept + Tools + Techniques E-Learning
Master the fundamentals of Big Data to unlock insights in a data-driven world.
In today’s fast-paced digital world, the amount of data being created is growing exponentially. Traditional data management techniques are no longer sufficient. This training introduces you to the foundational concepts of Big Data and equips you with the tools and techniques to capture, store, and analyze massive datasets. You’ll explore leading frameworks like Hadoop and Spark, and learn to design pipelines that power modern data infrastructure.
What you’ll learn:
Core Big Data concepts and principles
Key tools like Hadoop, Spark, Hive, and more
Cloud-based Big Data processing strategies
Designing scalable ETL pipelines
Real-world applications across industries
This course is part of an Agile Learning Kit, including e-learning, assessments, mentoring, labs, and 365-day access to all resources.
Why Choose This Training?
Build a solid foundation in Big Data architecture and tools
Get hands-on with industry-standard platforms and frameworks
Understand data processing in distributed cloud environments
Future-proof your career in the analytics and data space
Access all learning materials and mentorship for 365 days
Who Should Enroll?
This course is ideal for:
Data analysts and engineers entering the Big Data space
IT professionals managing or processing large data volumes
Students or career switchers exploring data roles
BI professionals leveraging modern data ecosystems
This Learning Kit with more than 25 hours of learning is divided into three tracks:
Demo Big Data Concept + Tools + Techniques Training
Course content
Big Data Infrastructures
In this learning, the focus will be on big data concepts, non-relational data, and big data analytics.
Courses (7 hours +)
The Big Data Technology Wave
Big Data in Perspective
Course: 17 Minutes
Course Introduction
Introducing Big Data
The Biggest Wave Yet
Emerging Technologies
Global Data
Course:14 Minutes
Defining Big Data
Key Terms for Data
Sizing Big Data
The Key Contributors
Course: 10 Minutes
The Original Key Contributors
The Distro Companies
The Apache Software Foundation
Course: 10 Minutes
Apache Software Foundation
Apache Projects
Other Apache Projects
Other Open Source Projects
Big Data Stack
Course: 13 Minutes
The Big Data Stack
Big Data Components
NoSQL Databases
Hadoop in Detail
Course: 31 Minutes
Distributed Computing
Design Principles of Hadoop
Functional View of Hadoop
HDFS in Action
Yarn in Action
MapReduce in Action
Spark in Action
Practice: Big Data elements and functions
Course: 15 Minutes
Exercise: Working with Big Data Elements
Big Data Opportunities and Challenges
Big Data Teams
Course: 28 Minutes
Course Introduction
The Big Data Team
Business Team Members
Analytics Team Members
Data Solutions Team Members
Cluster Team Members
Big Data Impacting IT
Big Data Projects
Course: 25 Minutes
DIY Supercomputing
Hadoop in the Clouds
Big Data and Data Warehouses
Business Case for Big Data
Big Data and RDBMS
Data Center Projects
Big Data Use Cases
Course: 20 Minutes
Data Analytics
Big Data Engines
Common Analytics Use Cases
Big Data Impacting the Globe
Opportunities and Challenges
Course: 32 Minutes
Global Increasing Digital Volume
The Big Companies
Big Data Opportunity
Big Data Challenges
Challenges of Security and Privacy
Planning for Big Data
Big Data Impacting Business
Practice: Challenges and Opportunities of Big Data
Exercise: Challenges and Opportunities of Big Data
Big Data Concepts: Getting to Know Big Data
Course: 43 Minutes
Course Overview
What Is Big Data?
Sources of Big Data
Characteristics of Big Data
Structured and Unstructured Data
Big Data Analytics
Advantages of Big Data Analytics
Big Data Analytics: Domain Use Cases
Big Data Analytics: Netflix Use Case
Big Data Analytics: Amazon Use Case
Major Challenges in Big Data
Course Summary
Big Data Concepts: Big Data Essentials
Course: 46 Minutes
Course Overview
Raw Data and Big Data
Data Warehousing and Big Data
Big Data Computing Systems
Horizontal and Vertical Scaling
Features, Benefits, and Use Cases of Hadoop
Hadoop: Components
Hadoop: Migration to the Cloud
Hadoop and Cloud Computing
Features of Big Data Storage Systems
In-memory Storage Systems
Course Summary
Non-relational Data: Non-relational Databases
Course: 52 Minutes
Course Overview
Non-relational Databases
The NoSQL Approach
Benefits of NoSQL
Document Databases
Key-value Data Stores
Graph Databases
Columnar Databases
HBase Architecture
Multi-model Databases
Next Generation NewSQL Databases
Course Summary
Big Data Analytics: Techniques for Big Data Analytics
Course: 39 Minutes
Course Overview
Big Data Analytics Challenges
Big Data Analytics Stack Layers
Big Data Ingestion
The Data Processing Layer
The Data Storage Layer
Pillars of Big Data Architecture
Batch Processing and Big Data
Stream Processing and Big Data
Lambda Architecture and Use Cases
Kappa Architecture
Course Summary
Big Data Analytics: Spark for High-speed Big Data Analytics
Course: 51 Minutes
Course Overview
The Core Characteristics of Apache Spark
Components of the Apache Spark Architecture
Apache Spark Use Case: Uber Using Spark
Apache Spark Use Case: Alibaba Using Spark
Apache Spark Use Case: The Healthcare Industry
Apache Spark vs. Hadoop
Top Apache Spark Use Cases
Apache Spark's Main Features
Apache Spark Performance Optimization Techniques
Apache Spark Best Practices
Course Summary
Harnessing Data Volume & Velocity: Big Data to Smart Data
Course: 39 Minutes
Course Overview
Comparing Big Data and Smart Data
Smart Data and Edge Technologies
Big Data to Smart Data Formation
Smart Data and Smart Processes
Smart Data Use Cases
Smart Data Life Cycle
Big Data to Smart Data Using k-NN
Smart Data Frameworks
Smart Data to Business
Clustering Smart Data
Smart Data Integration
Exercise: Transform Big Data to Smart Data
Securing Big Data Streams
Course: 1 Hour, 3 Minutes
Course Overview
Big Data Security Concerns
Streaming Data Security Concerns
NoSQL Database Security Concerns
Distributed Processing Security Risks
Data Mining and Analytics Privacy Flaws
End-Point Device Tampering Risks
Secure Big Data
Secure Data Streams
Secure Data In Motion
End-Point Input Validation and Filtering
Secure Data at Rest with Symmetric Ciphers
Exercise: Securing Big Data Streams
Assessment:
Big Data Infrastructures
Emerging New Age Architectures
In this learning, the focus will be on cloud data platforms, data lakes, and modern warehouses.
Courses (5 hours +)
Cloud Data Platforms: Cloud Computing
Course: 52 Minutes
Course Overview
Cloud Computing and Its Characteristics
Cloud Computing: Use Cases and Benefits
Cloud Computing Services: Storage and Compute Power
Types of Cloud Compute Power
Types of Cloud Storage
Cloud Computing Models: PaaS, IaaS, SaaS, and FaaS
Cloud Computing Model Comparison
Components of Cloud Computing Architectures
Cloud Service Provider Comparison
Cloud Elasticity and Scalability
Course Summary
Cloud Data Platforms: Cloud-based Applications & Storage
Course: 53 Minutes
Course Overview
Deploying Applications on Cloud Platforms
Characteristics of Cloud-ready Applications
Types of Cloud Deployment Models
Cloud Deployment Tools
Considerations for Cloud Application Deployment
CPU Virtualization, Memory, and I/O Devices
Cloud Storage Platforms
Cloud Storage Technologies
HDFS and Amazon S
Types of Data Centers
Course Summary
Cloud Data Platforms: AWS, Azure, & GCP Comparison
Course: 56 Minutes
Course Overview
Cloud Data Platforms: Amazon Web Services
Cloud Data Platforms: Microsoft Azure
Cloud Data Platforms: Google Cloud Platform
Cloud Analytics
Popular Cloud Analytics Tools
Cloud Computing Challenges: Security
Cloud Computing Challenges: Compliance
Cloud Computing Challenges: Cost Management
Cloud Computing Challenges: Governance
Future of Cloud Computing
Course Summary
Data Lakes and Modern Data Warehouses: Data Lakes
Course: 1 Hour, 19 Minutes
Course Overview
Data Lake Evolution
Modern Data Lake Architecture
Data Lakes: Key Concepts
Data Lake Maturity Stages
Data Swamps
Data Lake Platforms
Data Lake Platforms
Governed Data Lakes
Data Lakes: Risks and Challenges
Data Lakes vs. Data Warehouses
Course Summary
Data Lakes and Modern Data Warehouses: Modern Data Warehouses
Course: 1 Hour, 10 Minutes
Course Overview
Data Warehouses and Its Characteristics
Modern Data Warehouses: Key Concepts and Stages
Amazon Redshift
Google BigQuery
Modern Data Warehouses: Architecture and Processes
Modern Data Warehouses: Techniques
Data Warehouse Solutions: Batch Processing
Data Warehouse Solutions: Real-time Processing
Data Warehouse Solutions: Streaming Analytics
Hybrid Modern Data Warehouse
Course Summary
Data Lakes and Modern Data Warehouses: Azure Databricks & Data Pipelines
Course: 1 Hour, 2 Minutes
Course Overview
Azure Databricks: Features and Architecture
Azure Databricks: Pros and Cons
Snowflake Data Warehouses: Features and Architecture
Snowflake Data Warehouses: Pros and Cons
Data Pipelines
Components of a Data Pipeline
Advantages of a Data Pipeline
Types of Data Pipeline Tools
Comparing Data Pipeline Tools
Building a Data Pipeline
Course Summary
Assessment:
Emerging New Age Architectures
Apache Spark
Explore the basics of Apache Spark, an analytics engine used for big data processing.
Courses
Accessing Data with Spark (3 hours+)
Accessing Data with Spark: An Introduction to Spark
Course: 1 Hour, 7 Minutes
Course Overview
Introduction to Spark and Hadoop
Resilient Distributed Datasets (RDDs)
RDD Operations
Spark DataFrames
Spark Architecture
Spark Installation
Working with RDDs
Creating DataFrames from RDDs
Contents of a DataFrame
The SQLContext
The map() Function of an RDD
Accessing the Contents of a DataFrame
DataFrames in Spark and Pandas
Exercise: Working with Spark
Accessing Data with Spark: Data Analysis Using the Spark DataFrame API
Course: 1 Hour, 12 Minutes
Course Overview
Performance Improvements in Spark
Broadcast Variables and Accumulators
Loading Data into a DataFrame
Sampling the Contents of a DataFrame
Grouping and Aggregations
Visualizing Data in a DataFrame
Trimming and Cleaning Data
User-Defined Functions and DataFrames
Combining Filters, Aggregations, and Sorting
Using Broadcast Variables
Using Accumulators
Exporting DataFrame Contents
Custom Accumulators
Join Operations
Exercise: Data Analysis Using the DataFrame API
Accessing Data with Spark: Data Analysis using Spark SQL
Course: 55 Minutes
Course Overview
The Spark Catalyst Optimizer
Introduction to Spark SQL
Preparing Data for Analysis
Running SQL Queries
Inferred and Explicit Schemas
Windowing in Spark
Applying Window Functions
Exercise: Data Analysis Using Spark SQL
Big Data Development with Apache Spark (5 hours+)
Introduction to Apache Spark
Course: 1 Hour, 2 Minutes
Course Introduction
Overview of Apache Spark
Downloading and Installing Apache Spark
Downloading and Installing Apache Spark on Mac OS
Building Spark
Working with Spark Shell
Linking to Spark
Spark Configuration
Initializing Apache Spark
Running Spark on Clusters
Apache Spark SQL
Course: 1 Hour, 10 Minutes
Course Introduction
Apache Spark SQL Overview
SparkSession
DataFrames
Aggregations
SQL Queries
Temporary View
Datasets
JSON Datasets
Load/Save Functions
Specifying a Data Source
Querying with SQL
SaveMode
Parquet Files
Persistent Tables
Partitioning
Structured Streaming
Course: 1 Hour, 13 Minutes
Course Introduction
Structured Streaming Overview
Stream Input
Stream Output
Windowing
Continuous Applications
Deduplication
File Sinks
Streaming Query
Streaming Query Manager
Checkpointing
Word Count
Spark Monitoring and Tuning
Course: 59 Minutes
Monitoring Spark Applications
Course: 17 Minutes
Course Introduction
Web UI
Environment Configuration
REST API
Memory Allocation
Tuning Spark Applications
Course: 38 Minutes
Speculation
Serialization
Memory Tuning
Executor Memory
Garbage Collection Tuning
Parallelism
Broadcast Functionality
Explain Query Execution
Data Compression
Practice: Monitoring Spark Applications
Course: 4 Minutes
Exercise: Monitor Spark Applications4
Spark Security
Course: 36 Minutes
Course Introduction
Spark UI
Secure Event Logs
SSL Settings
Shared Secret
YARN Deployments
SASL Encryption
Network Security
Practice: Configuring Spark Security
Course: 3 Minutes
Exercise: Configure Spark Security
Practice Lab: Developing with Apache Spark (5 hours)
Practice developing with Apache Spark by performing tasks with Spark SQL, Spark Streaming, and GraphX. Then create a classification system using MLib and work with MLib Regression.
Apache Hadoop Apache Hadoop is an open-source framework for the storage and processing of big data.
Courses
Getting Started with Hadoop (5 hours+)
Introduction to Apache Spark
Course: 1 Hour, 2 Minutes
Course Introduction
Overview of Apache Spark
Downloading and Installing Apache Spark
Downloading and Installing Apache Spark on Mac OS
Building Spark
Working with Spark Shell
Linking to Spark
Spark Configuration
Initializing Apache Spark
Running Spark on Clusters
Apache Spark SQL
Course: 1 Hour, 10 Minutes
Course Introduction
Apache Spark SQL Overview
SparkSession
DataFrames
Aggregations
SQL Queries
Temporary View
Datasets
JSON Datasets
Load/Save Functions
Specifying a Data Source
Querying with SQL
SaveMode
Parquet Files
Persistent Tables
Partitioning
Structured Streaming
Course: 1 Hour, 13 Minutes
Course Introduction
Structured Streaming Overview
Stream Input
Stream Output
Windowing
Continuous Applications
Deduplication
File Sinks
Streaming Query
Streaming Query Manager
Checkpointing
Word Count
Spark Monitoring and Tuning
Course: 59 Minutes
Monitoring Spark Applications
Course: 17 Minutes
Course Introduction
Web UI
Environment Configuration
REST API
Memory Allocation
Tuning Spark Applications
Course: 38 Minutes
Speculation
Serialization
Memory Tuning
Executor Memory
Garbage Collection Tuning
Parallelism
Broadcast Functionality
Explain Query Execution
Data Compression
Practice: Monitoring Spark Applications
Course: 4 Minutes
Exercise: Monitor Spark Applications
Spark Security
Course: 36 Minutes
Course Introduction
Spark UI
Secure Event Logs
SSL Settings
Shared Secret
YARN Deployments
SASL Encryption
Network Security
Practice: Configuring Spark Security
Course: 3 Minutes
Exercise: Configure Spark Security
Working with Hadoop HDFS (3 hours+)
Hadoop HDFS: Introduction
Course: 1 Hour, 15 Minutes
Course Overview
Scaling Datasets
Horizontal Scaling for Big Data
Distributed Clusters and Horizontal Scaling
Overview of HDFS
HDFS Architectures
MapReduce for HDFS
YARN for HDFS
The Mechanism of Resource Allocation in Hadoop
Apache Zookeeper for HDFS
The Hadoop Ecosystem
Exercise: An Introduction to HDFS
Hadoop HDFS: Introduction to the Shell
Course: 53 Minutes
Course Overview
Creating a Hadoop Cluster on the Google Cloud
Exploring Hadoop Clusters
The YARN Cluster Manager UI
The HDFS NameNode UIs
Browsing the Packaged Hadoop Tools
Configuring HDFS
The HDFS Shells
Exercise: Introduction to the HDFS Shell
Hadoop HDFS: Working with Files
Course: 48 Minutes
Course Overview Basic Directory Commands in HDFS
Using the copyFromLocal Command in HDFS
Using the put Command in HDFS
Using the copyToLocal Command in HDFS
Retrieving files from HDFS
Append and Delete Operations in HDFS
Exercise: Working with Files on HDFS
Hadoop HDFS: File Permissions
Course: 49 Minutes
Course Overview
The HDFS count and du Commands
Viewing and Setting File Permissions in HDFS
Applying Permissions Recursively in HDFS
An Introduction to Bash Scripting
Scripting HDFS Operations
Exploring the HDFS NameNode UI
Cleanup Operations in HDFS
Data Warehousing with Hadoop (4 hours+)
Data Warehousing with Hadoop: Managing Big Data Using HDInsight Hadoop
Course: 1 Hour, 6 Minutes
Features of HDInsight
Fundamentals and Types of Clusters in HDInsight
Essential Opensource Components of HDInsight
Setting Up Hadoop Clusters on Azure HDInsight
HDInsight Clusters with Resource Manager Template
HDInsight Services and Storage Types
Azure Management Console
Creating and Managing HDInsight Clusters
Setting Up HDInsight Emulator
Programming in HDInsight
Developing and Executing MapReduce Program
Exercise: Working with HDInsight and MapReduce
Data Warehousing with Hadoop: Microsoft Analytics Platform System and Hive
Course: 1 Hour, 29 Minutes
Microsoft Analytics Platform System
Understanding PolyBase
Parallel Data Warehouse Architecture
Data Exploration Architectures
Hive Introduction
Hive Architecture in HDInsight
Setting up the Development Environment for Hive
Connect and Submit Queries
Hive QL
Using Azure PowerShell and Beeline
Creating a Database and Tables and Loading Data
Partition Tables and Data Formats
Hue Installation and Hive Query Management
Using Microsoft BI and Hive
Hive as ETL
HBase and Hive
Exercise: Creating and Loading Data into Hive Tables
Data Warehousing with Hadoop: HDInsight and Retail Sales Implementation Using Hive
Course: 46 Minutes
Data Modeling
Dimensional Design Process
Dimensional Design Steps
Retail Business Use Cases
Dimension Tables
Fact tables
Data Loading in Dimension and Fact Tables
Essential Queries
Creating and Executing Queries
Hive and Power BI for Visualization
Data Warehousing with Hadoop: Spark, HDInsight and Cluster Management
Course: 56 Minutes
Spark Introduction
Data Representation in Spark
Create Spark Clusters Using PowerShell
Spark SQL and Hive
Spark SQL Data Sources and DataFrames
Customizing HDInsight Cluster
Application Installation on HDInsight
Ambari User Management
HDInsight Management Using Azure CLI
Troubleshooting HDInsight
Monitoring HDInsight Hadoop
Exercise: Working with Spark and Ambari
Specifications
Article number
132047804
SKU
132047804
Language
English
Qualifications of the Instructor
Certified
Course Format and Length
Teaching videos with subtitles, interactive elements and assignments and tests
Lesson duration
25 Hours
Assesments
The assessment tests your knowledge and application skills of the topics in the learning pathway. It is available 365 days after activation.
Online Virtuele labs
Receive 12 months of access to virtual labs corresponding to traditional course configuration. Active for 365 days after activation, availability varies by Training
Online mentor
You will have 24/7 access to an online mentor for all your specific technical questions on the study topic. The online mentor is available 365 days after activation, depending on the chosen Learning Kit.
Progress monitoring
Access to Material
365 days
Technical Requirements
Computer or mobile device, Stable internet connections Web browsersuch as Chrome, Firefox, Safari or Edge.
Support or Assistance
Helpdesk and online knowledge base 24/7
Certification
Certificate of participation in PDF format
Price and costs
Course price at no extra cost
Cancellation policy and money-back guarantee
We assess this on a case-by-case basis
Award Winning E-learning
Tip!
Provide a quiet learning environment, time and motivation, audio equipment such as headphones or speakers for audio, account information such as login details to access the e-learning platform.
Big Data Concept + Tools + Techniques E-Learning Training Online Certified Teach...
€481,58€398,00
Specifications
Article number
132047804
SKU
132047804
Language
English
Qualifications of the Instructor
Certified
Course Format and Length
Teaching videos with subtitles, interactive elements and assignments and tests
Lesson duration
25 Hours
Assesments
The assessment tests your knowledge and application skills of the topics in the learning pathway. It is available 365 days after activation.
Online Virtuele labs
Receive 12 months of access to virtual labs corresponding to traditional course configuration. Active for 365 days after activation, availability varies by Training
Online mentor
You will have 24/7 access to an online mentor for all your specific technical questions on the study topic. The online mentor is available 365 days after activation, depending on the chosen Learning Kit.
Progress monitoring
Access to Material
365 days
Technical Requirements
Computer or mobile device, Stable internet connections Web browsersuch as Chrome, Firefox, Safari or Edge.
Support or Assistance
Helpdesk and online knowledge base 24/7
Certification
Certificate of participation in PDF format
Price and costs
Course price at no extra cost
Cancellation policy and money-back guarantee
We assess this on a case-by-case basis
Award Winning E-learning
Tip!
Provide a quiet learning environment, time and motivation, audio equipment such as headphones or speakers for audio, account information such as login details to access the e-learning platform.
Wij gebruiken functionele en analytische cookies om onze website goed te laten werken en het gebruik ervan te meten met Google Analytics. Er worden geen persoonsgegevens gedeeld voor advertentiedoeleinden. Door op "Accepteren" te klikken, geeft u toestemming voor het plaatsen van deze cookies.
Manage cookies