Back to Search
DataExpert-io

DataExpert-io/data-engineer-handbook

This is a repo with links to everything you'd ever want to learn about data engineering

39,434stars
7,570forks
39,434watchers
Updated 1/21/2026
apachesparkawesomebigdatadatadataengineeringsql

README.md

The Data Engineering Handbook

DataExpert-io%2Fdata-engineer-handbook | Trendshift

This repo has all the resources you need to become an amazing data engineer!

Getting started

If you are new to data engineering, start by following this 2024 breaking into data engineering roadmap

If you are here for the 4-week free beginner boot camp you can check out:

If you are here for the 6-week free intermediate boot camp you can check out

For more applied learning:

  • Check out the projects section for more hands-on examples!
  • Check out the interviews section for more advice on how to pass data engineering interviews!
  • Check out the books section for a list of high quality data engineering books
  • Check out the communities section for a list of high quality data engineering communities to join
  • Check out the newsletter section to learn via email

Resources

Great list of over 25 books

Top 3 must read books are:

Great list of over 10 communities to join:

Top must-join communities for DE:

Top must-join communities for ML:

Companies:

Data Engineering blogs of companies:

Data Engineering Whitepapers:

Social Media Accounts

Here's the mostly comprehensive list of data engineering creators: (You have to have at least 5k followers somewhere to be added!)

YouTube

NameYouTube ChannelFollower Count
ByteByteGoByteByteGo1,000,000+
Data with BaraaData with Baraa195,000+
Zach WilsonData with Zach150,000+
Shashank MishraE-learning Bridge100,000+
Seattle Data GuySeattle Data Guy100,000+
TrendyTechTrendyTech100,000+
Darshil ParmarDarshil Parmar100,000+
Andreas KretzAndreas Kretz100,000+
The Ravit ShowThe Ravit Show100,000+
Guy in a CubeGuy in a Cube100,000+
Adam MarczakAdam Marczak100,000+
nullQueriesnullQueries100,000+
TECHTFQ by ThoufiqTECHTFQ by Thoufiq100,000+
SQLBISQLBI100,000+
Alex FrebergAlex The Analyst100,000+
Ankur RanjanBig Data Show100,000+
Prashanth Kumar PandeyScholarNest77,000+
ITVersityITVersity67,000+
Soumil ShahSoumil Shah50,000
Ansh LambaAnsh Lamba18,000+
Azure LibAzure Lib10,000+
Advancing AnalyticsAdvancing Analytics10,000+
Kahan Data SolutionsKahan Data Solutions10,000+
Ankit BansalAnkit Bansal10,000+
Mr. K Talks TechMr. K Talks Tech10,000+
Samuel FochtPython Basics10,000+
Mehdi OuazzaMehdio DataTV3,000+
Alex MercedAlex Merced DataN/A
John KutayJohn KutayN/A
Emil KaminskiDatabricks For Professionals5,000+

LinkedIn

NameLinkedIn ProfileFollower Count
Zach WilsonZach Wilson400,000+
Chip HuyenChip Huyen250,000+
Shashank MishraShashank Mishra100,000+
Seattle Data GuyBen Rogojan100,000+
TrendyTechSumit Mittal100,000+
Darshil ParmarDarshil Parmar100,000+
Andreas KretzAndreas Kretz100,000+
ByteByteGo (Alex Xu)Alex Xu100,000+
Azure Lib (Deepak Goyal)Deepak Goyal100,000+
Alex FrebergAlex Freberg100,000+
SQLBI (Marco Russo)Marco Russo50,000+
Ankit BansalAnkit Bansal50,000+
Marc LambertiMarc Lamberti50,000+
Ankur RanjanAnkur Ranjan48,000+
ITVersity (Durga Gadiraju)Durga Gadiraju48,000+
Prashanth Kumar PandeyPrashanth Kumar Pandey37,000+
Alex MercedAlex Merced30,000+
Ijaz AliIjaz Ali24,000+
Mehdi OuazzaMehdi Ouazza20,000+
Ananth PackkilduraiAnanth Packkildurai18,000+
Ansh LambaAnsh Lamba13,000+
Manojkumar VadivelManojkumar Vadivel12,000+
Advancing AnalyticsSimon Whiteley10,000+
Li YinLi Yin10,000+
Jaco van GelderJaco van Gelder10,000+
Joseph MachadoJoseph Machado10,000+
Eric RobyEric Roby10,000+
Simon SpätiSimon Späti10,000+
Constantin LunguConstantin Lungu10,000+
Lakshmi SontenamLakshmi Sontenam9,500+
Dani PálmaDaniel Pálma9,000+
Soumil ShahSoumil Shah8,000+
Arnaud MillekerArnaud Milleker7,000+
Dimitri VisnadiDimitri Visnadi7,000+
LennyLenny A6,000+
Dipankar MazumdarDipankar Mazumdar5,000+
Daniel CiocirlanDaniel Ciocirlan5,000+
Hugo LuHugo Lu5,000+
Tobias MaceyTobias Macey5,000+
Marcos OrtizMarcos Ortiz5,000+
Julien HuraultJulien Hurault5,000+
John KutayJohn Kutay5,000+
Hassaan AkbarHassaan Akbar5,000+
SubhankarSubhankar5,000+
NitinNitinN/A
HassaanHassaan5000+
Javier de la TorreJavier5000+

X/Twitter

NameX/Twitter ProfileFollower Count
ByteByteGoalexxubyte100,000+
Dan Kornas@dankornas66,000+
Zach WilsonEcZachly30,000+
Seattle Data GuySeattleDataGuy10,000+
SQLBImarcorus10,000+
Joseph Machadostartdataeng5,000+
Alex Merced@amdatalakehouseN/A
John Kutay@JohnKutayN/A
Mehdi Ouazzamehd_ioN/A

Instagram

NameInstagram ProfileFollower Count
Sundas Khalidsundaskhalidd300,000+
Zach Wilsoneczachly150,000+
Andreas Kretzlearndataengineering5,000+
Alex Merced@alexmercedcoderN/A

TikTok

NameTikTok ProfileFollower Count
Zach Wilson@eczachly70,000+
Alex Freberg@alex_the_analyst10,000+
Mehdi Ouazza@mehdio_datatvN/A

Great Podcasts

Great list of 20+ newsletters

Top must follow newsletters for data engineering:

Glossaries:

Design Patterns

Courses / Academies

Certifications Courses