- 
                Notifications
    You must be signed in to change notification settings 
- Fork 271
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
      [AWQ] allow for use of model-wide kwargs cache
      
    
      
  
        
          #1985
            opened Oct 30, 2025  by
            brian-dellabetta
            
        
        
            
    •
    
      Draft
    
  
        
        
      
    
      [MoE] Clean up imports, add qwen3_moe_vl, change logger level
        
              
                ready
  When a PR is ready for review 
        
      
    
      
  
        
          #1981
            opened Oct 29, 2025  by
            kylesayrs
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [AWQ] Allow users to disable quantization during AWQ
      
    
      
  
        
          #1973
            opened Oct 28, 2025  by
            brian-dellabetta
            
        
        
            
    •
    
      Draft
    
  
        
        
      
    
      Modernize entrypoints module with type hints and use generic types
        
              
                ready
  When a PR is ready for review 
        
      
    
      
  
        
          #1965
            opened Oct 25, 2025  by
            sugatmahanti
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Fixing untie to be used only as needed and automatic
        
              
                ready
  When a PR is ready for review 
        
      
    
      
  
        
          #1963
            opened Oct 24, 2025  by
            HDCharles
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Oneshot] Add validation for empty dataset and enhance oneshot function parameters
      
    
      
  
        
          #1957
            opened Oct 21, 2025  by
            ArkaSanka
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Attention] Support FP4 attention quantization
        
              
                nvfp4
  For any PR / issue related to NVFP4 support 
        
      
    
      
  
        
          #1924
            opened Oct 14, 2025  by
            kylesayrs
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Training] Fix When a PR is ready for review 
        
      
    
      
  tokenizer attribute of SessionMixin
        
              
                ready
  
        
          #1895
            opened Oct 1, 2025  by
            kylesayrs
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Dependencies] update When a PR is ready for review 
        
      
    
      
  lm_eval version pin
        
              
                ready
  
        
          #1862
            opened Sep 24, 2025  by
            brian-dellabetta
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Logging] clean up CompressionLogger verbosity
        
              
                ready
  When a PR is ready for review 
        
      
    
        
          #1861
            opened Sep 23, 2025  by
            brian-dellabetta
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Updating base.py (parallel calibration and model #1809)
      
    
      
  
        
          #1837
            opened Sep 17, 2025  by
            aashvgit
            
        
        
            
    
  
    Loading…
 
        
        
      
    Previous Next
  
  
  ProTip!
  Mix and match filters to narrow down what you’re looking for.