Data-Centric Pipelines and Data Versioning
-
Updated
Feb 3, 2025 - Go
Data-Centric Pipelines and Data Versioning
A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docker instance: https://hub.docker.com/r/featurebasedb/featurebase
A search engine which can hold 100 trillion lines of log data.
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
TalariaDB is a distributed, highly available, and low latency time-series database for Presto
Distributed, Versioned, Image-oriented Dataservice
Hazelcast Go Client
DevLake: the open-source dev data platform & dashboard for your DevOps tools. *Note*: We have moved to Apache Software Foundation https://github.com/apache/incubator-devlake.
An extremely fast Non-crypto-safe AES Based Hash algorithm for Big Data
rtdl makes it easy to build and maintain a real-time data lake
Flux is a powerful tool designed to monitor proxy providers across the industry, analyzing response times, uptime, outgoing IPs, and more. With Flux, you can uncover the true performance and integrity of proxy providers, ensuring you're working with reliable data and not falling for misleading claims.
An admission control policy that safeguards against accidental duplicate claiming of Hosts/Domains.
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."