A simplified, lightweight ETL Framework based on Apache Spark
-
Updated
Jan 24, 2024 - Scala
A simplified, lightweight ETL Framework based on Apache Spark
Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Workflow engine for exploration of simulation models using high throughput computing
A purely functional library to build distributed and event-driven systems
Distributed Linear Programming Solver on top of Apache Spark
सूचि - Toolkit to build Distributed Data Systems
SANSA Machine Learning Layer
A Scala DSL to write type-safe programs for distributed computing
SANSA Query Layer
A general Inference API based on two of the most popular Big Data processing engines: Apache Spark and Apache Flink
SANSA Stack OWL (Web Ontology Language) API
SmartFD: Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms
Learn distributed systems in Scala using ZIO and Maelstrom
DGST: Efficient and Scalable Generalized Suffix Tree Construction on Apache Spark
Scalable record-level matching rules
Two-Phase Commit on akka-actors
Financial Forecasting and its correlation with Human Sentiments using Distributed Computing on Spark Framework
A distributed chess engine. In a world and in a time in which everyone goes deep-learning, this engine attempts to leverage the power of distributed, concurrent computing.
Scala version of Manikin
A distributed graph computing platform that enables simple visual analysis of large-scale relational data.
Add a description, image, and links to the distributed-computing topic page so that developers can more easily learn about it.
To associate your repository with the distributed-computing topic, visit your repo's landing page and select "manage topics."