Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.3k 194

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 1k 86

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 870 118

  4. PanzaMail PanzaMail Public

    Python 299 21

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 280 24

  6. llmq llmq Public

    Quantized LLM training in pure CUDA/C++.

    C++ 241 14

Repositories

Showing 10 of 81 repositories
  • ant Public Forked from gvlassis/ant

    🐜 Research-friendly Deep Learning framework

    IST-DASLab/ant’s past year of commit activity
    Python 0 MIT 1 0 0 Updated Mar 2, 2026
  • Quartet-II Public

    Quartet II Official Code

    IST-DASLab/Quartet-II’s past year of commit activity
    Python 51 4 0 1 Updated Mar 1, 2026
  • llmq Public

    Quantized LLM training in pure CUDA/C++.

    IST-DASLab/llmq’s past year of commit activity
    C++ 241 Apache-2.0 14 0 0 Updated Mar 1, 2026
  • FP-Quant Public
    IST-DASLab/FP-Quant’s past year of commit activity
    Python 98 17 11 3 Updated Feb 26, 2026
  • DASH Public

    Code for DASH, a faster alternative to Distributed Shampoo via block stacking.

    IST-DASLab/DASH’s past year of commit activity
    Python 8 MIT 0 1 0 Updated Feb 18, 2026
  • GPTQ-Babai Public

    Official Repository for "The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm" (ICLR 2026)

    IST-DASLab/GPTQ-Babai’s past year of commit activity
    2 MIT 0 0 0 Updated Feb 18, 2026
  • MatGPTQ Public

    Code for MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

    IST-DASLab/MatGPTQ’s past year of commit activity
    Python 10 MIT 1 0 0 Updated Feb 18, 2026
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 13 MIT 0 0 0 Updated Feb 18, 2026
  • WUSH Public

    Official Repository for "WUSH: Near-Optimal Adaptive Transforms for LLM Quantization"

    IST-DASLab/WUSH’s past year of commit activity
    Python 2 Apache-2.0 0 0 0 Updated Feb 16, 2026
  • PanzaMail Public
    IST-DASLab/PanzaMail’s past year of commit activity
    Python 299 Apache-2.0 21 4 7 Updated Feb 15, 2026

Most used topics

Loading…