Skip to main content
T-MDMDP-IOfficial Google Curriculum

Managing a Data Mesh with Dataplex

2 daysILTIntermediateLoading...

Overview

Dataplex is an intelligent data fabric that enables organizations to centrally discover, manage, monitor, and govern their data across data lakes, data warehouses, and data marts. You can use Dataplex to build a data mesh architecture to decentralize data ownership among domain data owners. In this course, you will learn how to discover, manage, monitor, and govern your data across data lakes, data warehouses, and data marts through guided lectures and independent exercises using sample data.

What You'll Learn

  • Identify the importance of a modern data platform
  • Configure and set up Dataplex
  • Secure data lakes, zones, and assets
  • Implement tagging for resources and use tags to search for assets
  • Process data using Dataplex tasks
  • Design, execute and report on data quality processes

Who Should Attend

Customers

Prerequisites

Completion of the "Modernizing Data Lakes and Data Warehouses with Google Cloud" and "Building Batch Data Pipelines on Google Cloud" courses in the "Data Engineer" learning path or equivalent experience using Google Cloud.

Products Covered

DataplexCloud StorageBigQueryDataflow

Course Modules

1

Introduction to Dataplex

Topics

  • Modern Data Platforms and Data-Oriented Design
  • Pillars of Data Governance
  • What is Dataplex?
  • Dataplex Capabilities
  • Dataplex compared with other products on Google Cloud

Learning Outcomes

  • Identify the importance of a modern data platform
  • Explain the role of Dataplex on Google Cloud
2

Creating a Data Mesh on Dataplex

Topics

  • What is a data mesh?
  • Dataplex concepts
  • Creating data lakes and zones
  • Assets in Dataplex

Learning Outcomes

  • Define key Dataplex concepts
  • Configure and set up Dataplex

Activities

Lab: Provision a Data Mesh using Dataplex
3

Processing Data on Dataplex

Topics

  • Processing data on Dataplex
  • Data preparation tasks
  • Ingestion jobs
  • Dataflow and Spark tasks

Learning Outcomes

  • Understand different data processing options in Dataplex
  • Configure and run data preparation tasks on Dataplex

Activities

Lab: Standardize Data using Dataplex Tasks
4

Managing Data Security through Dataplex

Topics

  • IAM permissions and roles
  • Securing your data lake
  • Policy management
  • Metadata security

Learning Outcomes

  • Secure data lakes, zones, and assets in Dataplex

Activities

Lab: Manage Data Security using Dataplex
5

Data Tagging and Data Catalog

Topics

  • Introduction to Data Catalog
  • Technical metadata vs. business metadata
  • Tags and tag templates
  • Entries and entry groups
  • Data lineage

Learning Outcomes

  • Implement tagging for resources and use tags to search for assets

Activities

Lab: Data Catalog and Data Lineage
6

Data Quality and Profiling

Topics

  • Data quality tasks and AutoDQ
  • Reporting on data quality
  • Data profiling

Learning Outcomes

  • Design, execute and report on data quality processes

Activities

Lab: Data Quality and Profiling your Data in BigQuery
7

Dataplex Best Practices

Topics

  • Best practices
  • End-to-end demo

Learning Outcomes

  • Implement best practices for Dataplex

Activities

Challenge Lab: Managing a Data Mesh with Dataplex

What's Not Covered

  • This course does not cover the interaction of Dataplex with Dataproc Metastore nor does it do a deep dive into BigLake concepts

Get This Training

No public classes currently scheduled. Express interest below or request private training.

Request Private Session

Course Details

Course Code
T-MDMDP-I
Duration
2 days
Format
ILT
Level
Intermediate
Modules
7
Activities
6
Price
Loading...
View Official Google Datasheet →

Questions About This Course?

Contact us for custom scheduling, group discounts, or curriculum customization.

Contact Us
Starting fromLoading...