







































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A comprehensive overview of data engineering, covering key concepts, roles, and technologies. It explores the relationship between data engineering and big data, highlighting the importance of data pipelines and etl processes. The document also delves into the responsibilities of data engineers and their role in enabling data scientists. It is a valuable resource for anyone interested in understanding the fundamentals of data engineering and its applications in the context of big data.
Typology: Summaries
1 / 79
This page cannot be seen from the preview
Don't miss anything!








































































U N D E R S TA N D I N G D ATA E N G I N E E R I N G Hadrien Lacroix Content Developer at DataCamp
Conceptual course No coding involved Objectives Being able to exchange with data engineers Provide a solid foundation to learn more
How data storage works
How to move and process data
Ingest data from different sources Optimize databases for analysis Remove corrupted data Develop, construct, test and maintain data architectures
Big data becomes the norm =>
Sensors and devices Social media Enterprise data VoIP (voice communication, multimedia sessions) Data Age 2025, Seagate, November 2018 1
Volume (how much?) Variety (what kind?) Velocity (how frequent?) Veracity (how accurate?) Value (how useful?)
U N D E R S TA N D I N G D ATA E N G I N E E R I N G Hadrien Lacroix Content Developer at DataCamp