Skip to content
View abodacs's full-sized avatar

Block or report abodacs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

data-engineering

64 repositories

The best place to learn data engineering. Built and maintained by the data engineering community.

CSS 1,616 183 Updated Mar 22, 2025

Classwork projects and home works done through Udacity data engineering nano degree

Jupyter Notebook 74 72 Updated Dec 12, 2023

More than 2000+ Data engineer interview questions.

1,295 461 Updated Jan 26, 2025

Free Introduction to Bash Scripting eBook

HTML 4,647 486 Updated Jan 21, 2025

An orchestration platform for the development, production, and observation of data assets.

Python 12,804 1,631 Updated Mar 27, 2025

Microsoft REST API Guidelines

22,942 2,718 Updated Feb 19, 2025

A model set of guidelines for RESTful APIs and Events, created by Zalando

CSS 2,898 411 Updated Mar 3, 2025

A collection of docker-compose files

HTML 364 47 Updated Feb 27, 2025

Simple, open source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.

Elixir 21,889 1,168 Updated Mar 27, 2025

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…

Python 8,497 655 Updated Mar 11, 2025

Edge Inference in Browser with Transformer NLP model

Jupyter Notebook 310 56 Updated Sep 27, 2022

Site Reliability Engineer Interview Preparation Guide

7,748 2,004 Updated Mar 8, 2025

Let ChatGPT teach your own chatbot in hours with a single GPU!

Python 3,169 290 Updated Mar 17, 2024

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

TypeScript 4,250 311 Updated Mar 27, 2025

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 4,296 266 Updated Mar 27, 2025

This is my video documentation. Here you'll find code-snippets, technical documentation, templates, command reference, and whatever is needed for all my YouTube Videos.

Python 938 338 Updated Jun 1, 2024
TypeScript 59 1 Updated Jun 18, 2023

Tomato Architecture - A common sense driven approach to software architecture

674 35 Updated Feb 2, 2024

A collection of debugging stories. PRs welcome (sorry for the backlog) :-)

3,790 144 Updated May 29, 2024

An organized learning path on Clean Code, Test-Driven Development, Legacy Code, Refactoring, Domain-Driven Design and Microservice Architecture

2,924 352 Updated Jan 19, 2022

A Data Lakehouse Proof-of-Concept for streaming CDC events from a database into cloud storage using Delta Lake

Scala 6 Updated Aug 22, 2023

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Jupyter Notebook 2,067 139 Updated Mar 21, 2025

Bootstrap Kubernetes the hard way on Vagrant on Local Machine. No scripts.

Shell 4,882 4,693 Updated Sep 8, 2024

Checklist of the most important security countermeasures when designing, testing, and releasing your API

22,736 2,630 Updated Nov 22, 2024

Platform Engineering on Kubernetes :: Book Examples

HTML 270 151 Updated Sep 24, 2024

A Simple Bulk Labelling Tool

Python 571 48 Updated Dec 29, 2024

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

71,236 7,533 Updated Aug 16, 2024

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

13,346 727 Updated Mar 19, 2025

My chapter-wise notes for Database Internals by Alex Petrov.

420 51 Updated Mar 21, 2024

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 27,204 5,560 Updated Feb 22, 2025