Skip to content
@efeslab

Efeslab

Efeslab at the University of Washington

Popular repositories Loading

  1. Nanoflow Nanoflow Public

    A throughput-oriented high-performance serving framework for LLMs

    Cuda 779 31

  2. Atom Atom Public

    [MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

    Cuda 299 27

  3. fiddler fiddler Public

    [ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

    Python 201 18

  4. LiteASR LiteASR Public

    LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

    Python 88 2

  5. lapidary lapidary Public archive

    Creating beautiful gem5 simulations

    C++ 47 13

  6. DMon-AE DMon-AE Public

    DMon Prototype for OSDI 2021 Artifact Evaluation

    C++ 22 1

Repositories

Showing 10 of 99 repositories
  • Pathfinder Public

    Scalable and accurate crash-consistency testing tool for POSIX-based and MMIO-based applications.

    C++ 4 0 0 0 Updated Mar 23, 2025
  • LiteASR Public

    LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

    Python 88 Apache-2.0 2 2 0 Updated Mar 5, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 0 Apache-2.0 6,559 0 0 Updated Jan 24, 2025
  • genmc Public Forked from MPI-SWS/genmc

    Generic model checker for concurrent C programs (mirror repository)

    C++ 0 GPL-3.0 21 0 0 Updated Jan 19, 2025
  • wiredtiger Public Forked from wiredtiger/wiredtiger

    WiredTiger's source tree

    C 0 398 0 0 Updated Jan 8, 2025
  • rocksdb-squint Public Forked from facebook/rocksdb

    A library that provides an embeddable, persistent key-value store for fast storage.

    C++ 0 GPL-2.0 6,582 0 0 Updated Jan 7, 2025
  • alice Public Forked from madthanu/alice
    C 0 26 0 0 Updated Jan 6, 2025
  • leveldb Public Forked from google/leveldb

    LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

    C++ 0 BSD-3-Clause 8,146 0 0 Updated Dec 28, 2024
  • C++ 0 Apache-2.0 0 0 0 Updated Nov 30, 2024
  • fiddler Public

    [ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

    Python 201 Apache-2.0 18 2 0 Updated Nov 18, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.