A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/Pallas/JAX).
-
Updated
Mar 4, 2025 - Python
A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/Pallas/JAX).
Calculate the hash of any input for ZK-Friendly hashes (MiMC & Poseidon) over a variety of Elliptic Curves.
A library used to access the Pallas service
Benchmarking the JAX Pallas implementation of a custom RNN against alternatives
Repo to hold core components when building a Pallas Systems Website
Add a description, image, and links to the pallas topic page so that developers can more easily learn about it.
To associate your repository with the pallas topic, visit your repo's landing page and select "manage topics."