Skip to content
@FastLM

FastLM

We develop fast, lightweighted LM in large-scale, distributed, parallel, sparsity senarios.

Popular repositories Loading

  1. tinyserve-vllm tinyserve-vllm Public

    [ACM MM 2025 Oral] TinyServe: Query-Aware Page Allocation Optimization

    Python 9 2

  2. CSV-Decode CSV-Decode Public

    CSV-Decode: Certifiable Sub-Vocabulary Decoding for Efficient Large Language Model Inference

    Python 7

  3. FastCache FastCache Public

    Forked from NoakLiu/FastCache-xDiT

    FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]

    Python 6

  4. HSGM HSGM Public

    [ICPADS 2025 Oral, *SEM 2025 Oral] HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics

    Python 6

  5. SPI_VecDB SPI_VecDB Public

    [ICPADS 2025 Oral] Distributed Parallel Multi-Resolution Vector Search

    Go 6

  6. PiKV PiKV Public

    Forked from NoakLiu/PiKV

    PiKV: KV Cache Management System for MoE [Efficient ML System]

    Python 5

Repositories

Showing 10 of 12 repositories

Top languages

Python C++ Go

Most used topics

Loading…