Tuesday, February 15, 2022

Data Engineering Roadmap 2022 for beginner

1. Programming Languages

2. Learn Linux


Linux Essentials - Beginner Crash Course (Ubuntu) https://www.youtube.com/watch?v=n_2jPbQornY


3. Learn about Data Structures and Algorithms

DATA STRUCTURES you MUST know https://www.youtube.com/watch?v=sVxBVvlnJsM


4. Learn about Core DBMS (Database Management Systems)

Learn RDBMS in 6 minutes https://www.youtube.com/watch?v=t48TGntrX4s

5. Learn SQL

SQL Tutorial - Full Database Course for Beginners https://www.youtube.com/watch?v=HXV3zeQKqGY


6. Data Exploration Libraries (Pandas — NumPy — Spark)


7. Data Warehousing and Data Lake Concepts




8. Learn about Distributed Computing and Cloud Computing 

Cloud Computing Tutorial for Beginners https://www.youtube.com/watch?v=RWgW-CgdIk0
Distributed Systems | Distributed Computing Explained https://www.youtube.com/watch?v=ajjOEltiZm4

9. Workflow schedulers


Apache Airflow for beginners https://www.youtube.com/watch?v=YWtfU0MQZ_4


10. NoSQL Databases


11. Streaming Systems

Kafka Streams 101: Getting Started https://www.youtube.com/watch?v=y9a3fldlvnI

12. Dashboarding tools



14. Data Engineering in the Cloud

AWS Data Engineering Course - Full Course https://www.youtube.com/watch?v=ckQ7d6ca2J0
Google Cloud Platform Full Course https://www.youtube.com/watch?v=IUU6OR8yHCc

15. DevOps (Docker — Kubernetes)


Docker and Kubernetes Tutorial | Full Course [2021] https://www.youtube.com/watch?v=bhBSlnQcq2k

16. System Design

System Design Course for Beginners https://www.youtube.com/watch?v=MbjObHmDbZo
System Design Interview – Step By Step Guide https://www.youtube.com/watch?v=bUHFg8CZFws
System Design Mock Interview: Design Instagram https://www.youtube.com/watch?v=VJpfO6KdyWE

This series touches key areas in system design, which are used to design real world systems and interview questions.

  1. Load balancing
  2. Message Passing
  3. Microservice architecture
  4. NoSQL databases
  5. Distributed Systems

https://www.youtube.com/playlist?list=PLMCXHnjXnTnvo6alSjVkgxV-VH6EPyvoX

Wednesday, February 2, 2022

[Video Lecture] Học máy thống kê và khoa học phân tích dữ liệu lớn

 


Bài giảng đại chúng “Học máy thống kê và khoa học phân tích dữ liệu lớn” Phần 1


Bài giảng đại chúng “Học máy thống kê và khoa học phân tích dữ liệu lớn” phần 2

Thursday, January 6, 2022

5 Machine Learning BEGINNER Projects (+ Datasets & Solutions)

I this tutorial I share 5 Beginner Machine Learning projects with you, and I give you tips how to solve all of them. These projects are for complete beginners and should teach you some basic machine learning concepts. With each project the difficulty increases a little bit and you’ll learn a new algorithm.


For each project we give you an algorithm that you can use. The links to the datasets can be found below.

Project 1:

Project 2:

Project 3:

Project 4:

Project 5:

 

Featured Post

How to build Unified Experience Platform (CDP & DXP)

USPA framework What is USPA framework ? The USPA framework is conceptual framework, to develop Unified Experience Platform (the unified of ...