Skip to content
View erjan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report erjan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Yet Another Testing Language. Less code. More tests. Lets you describe API tests in clean YAML.

Python 23 Updated Apr 28, 2026

📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.

Python 68 15 Updated Jan 18, 2025

Notes talking about the design and implementation of Apache Spark

5,368 1,830 Updated Apr 2, 2024

Roadmap для Data Engineer. Цель роадмапа – устроиться тебе на работу!

489 179 Updated Mar 30, 2026

This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main

Dockerfile 104 116 Updated Aug 20, 2024

My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment anal…

Scala 506 130 Updated Aug 24, 2022

100+ Python challenging programming exercises

29,015 6,950 Updated Apr 6, 2026

Practice your pandas skills!

Jupyter Notebook 12,645 8,948 Updated Oct 17, 2025

An example project that demontrates real time big data stream processing using GigaSpaces

Java 19 9 Updated Feb 26, 2022

100 numpy exercises (with solutions)

Python 14,066 6,684 Updated Apr 29, 2026

Data Engineering pet-project covering GCP, Docker, workflow orchestration with Mage, data transforming with dbt, batch processing via Spark

Python 1 Updated Apr 21, 2024

Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboard is then used to support a purchasing decision of which He…

Python 268 60 Updated Jan 1, 2023

The smart city reference pipeline shows how to integrate various media building blocks, with analytics powered by the OpenVINO™ Toolkit, for traffic or stadium sensing, analytics and management tasks.

Python 218 90 Updated May 5, 2025

Terminal User Interface (TUI) apps

Python 983 66 Updated Feb 2, 2026

This project shows how to capture changes from postgres database and stream them into kafka

Python 42 21 Updated May 17, 2024

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 3,348 594 Updated Aug 16, 2024
Jupyter Notebook 18 30 Updated Aug 30, 2019

My solution to the book <A collection of Data Science Take-home Challenges>

Jupyter Notebook 990 525 Updated Oct 31, 2022
JavaScript 1 Updated Jun 15, 2023

Sample project to demonstrate data engineering best practices

Python 216 43 Updated Feb 24, 2024

DataTalks.Club's Data Engineering Zoomcamp Project

Python 11 3 Updated May 7, 2023

Final Project of the MLOps Zoomcamp hosted by DataTalksClub.

HTML 25 5 Updated Dec 19, 2022

DataTalks.Club's Data Engineering Zoomcamp Project

Python 24 7 Updated Jul 14, 2022

A repo to track data engineering projects

Jupyter Notebook 13 6 Updated Nov 11, 2022

A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour metric table.

HCL 18 3 Updated Aug 14, 2025

A project portfolio to accompany my resume

Python 30 5 Updated Sep 5, 2023

Insight Data Engineering Project

Python 15 10 Updated Jun 1, 2021

Data Engineering Project in GCP

Python 22 4 Updated Mar 29, 2023

In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data from the Spotify API, transform into desired format and load it…

Jupyter Notebook 25 4 Updated May 6, 2023
Next