Skip to content
View kssteven418's full-sized avatar

Block or report kssteven418

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. SqueezeAILab/LLMCompiler SqueezeAILab/LLMCompiler Public

    [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

    Python 1.8k 122

  2. SqueezeAILab/SqueezeLLM SqueezeAILab/SqueezeLLM Public

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    Python 704 49

  3. Squeezeformer Squeezeformer Public

    [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

    Python 261 19

  4. I-BERT I-BERT Public

    [ICML'21 Oral] I-BERT: Integer-only BERT Quantization

    Python 258 42

  5. LTP LTP Public

    [KDD'22] Learned Token Pruning for Transformers

    Python 100 19

  6. BigLittleDecoder BigLittleDecoder Public

    [NeurIPS'23] Speculative Decoding with Big Little Decoder

    Python 94 11