Skip to content
Change the repository type filter

All

    Repositories list

    • tiny-tpu

      Public
      A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1
      SystemVerilog
      107000Updated Aug 18, 2025Aug 18, 2025
    • Simplify your onnx model
      C++
      Apache License 2.0
      430100Updated Jul 8, 2024Jul 8, 2024
    • tvm-vta

      Public
      Open, Modular, Deep Learning Accelerator
      Scala
      Apache License 2.0
      91000Updated Apr 10, 2024Apr 10, 2024
    • fpga-npu

      Public
      SystemVerilog
      64100Updated Apr 8, 2024Apr 8, 2024
    • Notebooks and code for Neuromorphic Hardware Workshop at ISFPGA 2024.
      Jupyter Notebook
      MIT License
      9000Updated Mar 3, 2024Mar 3, 2024
    • neureka

      Public
      SystemVerilog
      Other
      7000Updated Feb 16, 2024Feb 16, 2024
    • redmule

      Public
      Stata
      Other
      23000Updated Feb 16, 2024Feb 16, 2024
    • SNN-DSE

      Public
      Hardware and software implementation of Sparsely-active SNNs
      SystemVerilog
      5000Updated Dec 28, 2023Dec 28, 2023
    • acceltran

      Public
      [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
      Python
      BSD 3-Clause "New" or "Revised" License
      11000Updated Nov 22, 2023Nov 22, 2023
    • Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator
      Python
      MIT License
      14000Updated Nov 20, 2023Nov 20, 2023
    • xla

      Public
      A machine learning compiler for GPUs, CPUs, and ML accelerators
      C++
      Apache License 2.0
      855100Updated Oct 19, 2023Oct 19, 2023
    • nngen

      Public
      NNgen: A Fully-Customizable Hardware Synthesis Compiler for Deep Neural Network
      Python
      Apache License 2.0
      51000Updated Oct 17, 2023Oct 17, 2023
    • A RiscV CPU with an accelerator for accelerating neural networks attached to it
      Verilog
      2000Updated Aug 8, 2023Aug 8, 2023
    • garnet

      Public
      Next generation CGRA generator
      Python
      BSD 3-Clause "New" or "Revised" License
      13000Updated Apr 26, 2023Apr 26, 2023
    • 收集关于嵌入式领域的机器学习算法实现的进展、相关论文和文章、开发库等,帮助初学者快速了解、学习和入门嵌入式领域的机器学习。CC-BY-NC-SA 4.0。
      81000Updated Apr 12, 2023Apr 12, 2023
    • qonnx

      Public
      QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
      Python
      Apache License 2.0
      59000Updated Feb 27, 2023Feb 27, 2023
    • Block minifloat MAC unit for single block GEMM
      C
      1000Updated Feb 20, 2023Feb 20, 2023
    • The Hardware Impact of Quantization and Pruning for Weights in Spiking Neural Networks
      Python
      4000Updated Feb 9, 2023Feb 9, 2023
    • TENNA

      Public
      TENNA: Tiny Embedded Neural Network Accelerator
      MIT License
      1100Updated Jan 16, 2023Jan 16, 2023
    • CHARM

      Public
      CHARM: Composing Heterogeneous Accelerators for Matrix Multiply on Versal ACAP Architecture (Full Paper accepted to FPGA2023!)
      C++
      MIT License
      24000Updated Jan 12, 2023Jan 12, 2023
    • BARVINN

      Public
      BARVINN: A Barrel RISC-V Neural Network Accelerator: https://barvinn.readthedocs.io/en/latest/
      Tcl
      MIT License
      16000Updated Jan 6, 2023Jan 6, 2023
    • A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving th…
      240000Updated Dec 17, 2022Dec 17, 2022
    • iob-versat

      Public template
      Coarse Grained Reconfigurable Array
      Verilog
      MIT License
      15000Updated Dec 7, 2022Dec 7, 2022
    • Magicube

      Public
      Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
      C++
      GNU General Public License v3.0
      16000Updated Nov 23, 2022Nov 23, 2022
    • RKNN from Rockchip
      Python
      BSD 3-Clause "New" or "Revised" License
      187000Updated Nov 21, 2022Nov 21, 2022
    • antares

      Public
      Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL f…
      Python
      Other
      48100Updated Nov 15, 2022Nov 15, 2022
    • This is a list of awesome edgeAI inference related papers.
      9100Updated Nov 8, 2022Nov 8, 2022
    • nitta

      Public
      NITTA - Tool for Hard Real-Time CGRA Processors
      Haskell
      BSD 3-Clause "New" or "Revised" License
      10000Updated Nov 2, 2022Nov 2, 2022
    • braggHLS

      Public
      PyTorch model to RTL flow for low latency inference
      SystemVerilog
      MIT License
      12600Updated Sep 4, 2022Sep 4, 2022
    • gamma

      Public
      Python
      MIT License
      19000Updated Sep 4, 2022Sep 4, 2022
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.