Skip to content
Change the repository type filter

All

    Repositories list

    • DALI

      Public
      A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
      C++
      6585.6k22533Updated Jan 26, 2026Jan 26, 2026
    • Megatron-Energon

      Public
      Megatron's multi-modal data loader
      Python
      38310144Updated Jan 26, 2026Jan 26, 2026
    • cuEquivariance

      Public
      cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks. Also includes kernels for accelerated structure prediction.
      Python
      24349133Updated Jan 26, 2026Jan 26, 2026
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3262.1k1.2k207Updated Jan 26, 2026Jan 26, 2026
    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      32489742387Updated Jan 26, 2026Jan 26, 2026
    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.5k15k306273Updated Jan 26, 2026Jan 26, 2026
    • makani

      Public
      Massively parallel training of machine-learning based weather and climate models
      Python
      6335144Updated Jan 26, 2026Jan 26, 2026
    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      2k13k519469Updated Jan 26, 2026Jan 26, 2026
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      371653315Updated Jan 26, 2026Jan 26, 2026
    • doca-platform

      Public
      DOCA Platform manages provisioning and service orchestration for Bluefield DPUs
      Go
      187500Updated Jan 26, 2026Jan 26, 2026
    • spark-rapids

      Public
      Spark RAPIDS plugin - accelerate Apache Spark with GPUs
      Scala
      2719591.8k36Updated Jan 26, 2026Jan 26, 2026
    • k8s-device-plugin

      Public
      NVIDIA device plugin for Kubernetes
      Go
      7803.6k7239Updated Jan 26, 2026Jan 26, 2026
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
      Python
      6824212Updated Jan 26, 2026Jan 26, 2026
    • gpu-operator

      Public
      NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4422.5k9165Updated Jan 26, 2026Jan 26, 2026
    • bionemo-framework

      Public
      BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      11564161116Updated Jan 26, 2026Jan 26, 2026
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1166808418Updated Jan 26, 2026Jan 26, 2026
    • TileGym

      Public
      Helpful kernel tutorials and examples for tile-based GPU programming
      Python
      3660921Updated Jan 26, 2026Jan 26, 2026
    • sandbox-device-plugin

      Public
      Kubernetes Device Plugin to help cold plug vfio/iommufd GPUs in Kata VMs for Confidential Containers
      Go
      3217Updated Jan 26, 2026Jan 26, 2026
    • stdexec

      Public
      `std::execution`, the proposed C++ framework for asynchronous and parallel programming.
      C++
      2252.2k12215Updated Jan 26, 2026Jan 26, 2026
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2421.9k6470Updated Jan 26, 2026Jan 26, 2026
    • NeMo-Agent-Toolkit

      Public
      The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4941.8k6628Updated Jan 26, 2026Jan 26, 2026
    • nvidia-resiliency-ext

      Public
      NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
      Python
      42253219Updated Jan 26, 2026Jan 26, 2026
    • recsys-examples

      Public
      Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
      Python
      432114311Updated Jan 26, 2026Jan 26, 2026
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      75375211203Updated Jan 26, 2026Jan 26, 2026
    • spark-rapids-examples

      Public
      A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
      Jupyter Notebook
      62166213Updated Jan 26, 2026Jan 26, 2026
    • accelerated-computing-hub

      Public
      NVIDIA curated collection of educational resources related to general purpose GPU programming.
      Jupyter Notebook
      1971.1k144Updated Jan 26, 2026Jan 26, 2026
    • NV-Kernels

      Public
      Ubuntu kernels which are optimized for NVIDIA server systems
      C
      5387014Updated Jan 26, 2026Jan 26, 2026
    • aistore

      Public
      AIStore: scalable storage for AI applications
      Go
      2321.7k20Updated Jan 26, 2026Jan 26, 2026
    • k8s-dra-driver-gpu

      Public
      NVIDIA DRA Driver for GPUs
      Go
      1135518923Updated Jan 25, 2026Jan 25, 2026
    • KAI-Scheduler

      Public
      KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
      Go
      1381.1k3161Updated Jan 25, 2026Jan 25, 2026