All

165 repositories

tiny-tpu
Public
A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1
SystemVerilog
•107•0•0•0•Updated Aug 18, 2025Aug 18, 2025
onnx-simplifier
Public
Simplify your onnx model
C++
•
Apache License 2.0
•430•1•0•0•Updated Jul 8, 2024Jul 8, 2024
tvm-vta
Public
Open, Modular, Deep Learning Accelerator
Scala
•
Apache License 2.0
•91•0•0•0•Updated Apr 10, 2024Apr 10, 2024
fpga-npu
Public
SystemVerilog
•64•1•0•0•Updated Apr 8, 2024Apr 8, 2024
fpga-snntorch-spike
Public
Notebooks and code for Neuromorphic Hardware Workshop at ISFPGA 2024.
Jupyter Notebook
•
MIT License
•9•0•0•0•Updated Mar 3, 2024Mar 3, 2024
neureka
Public
SystemVerilog
•
Other
•7•0•0•0•Updated Feb 16, 2024Feb 16, 2024
redmule
Public
Stata
•
Other
•23•0•0•0•Updated Feb 16, 2024Feb 16, 2024
SNN-DSE
Public
Hardware and software implementation of Sparsely-active SNNs
SystemVerilog
•5•0•0•0•Updated Dec 28, 2023Dec 28, 2023
acceltran
Public
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
Python
•
BSD 3-Clause "New" or "Revised" License
•11•0•0•0•Updated Nov 22, 2023Nov 22, 2023
halutmatmul
Public
Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator
Python
•
MIT License
•14•0•0•0•Updated Nov 20, 2023Nov 20, 2023
xla
Public
A machine learning compiler for GPUs, CPUs, and ML accelerators
C++
•
Apache License 2.0
•855•1•0•0•Updated Oct 19, 2023Oct 19, 2023
nngen
Public
NNgen: A Fully-Customizable Hardware Synthesis Compiler for Deep Neural Network
Python
•
Apache License 2.0
•51•0•0•0•Updated Oct 17, 2023Oct 17, 2023
RiscV_CPU_with_Accelerator
Public
A RiscV CPU with an accelerator for accelerating neural networks attached to it
Verilog
•2•0•0•0•Updated Aug 8, 2023Aug 8, 2023
garnet
Public
Next generation CGRA generator
Python
•
BSD 3-Clause "New" or "Revised" License
•13•0•0•0•Updated Apr 26, 2023Apr 26, 2023
Awesome-Embeded-AI
Public
收集关于嵌入式领域的机器学习算法实现的进展、相关论文和文章、开发库等，帮助初学者快速了解、学习和入门嵌入式领域的机器学习。CC-BY-NC-SA 4.0。
81•0•0•0•Updated Apr 12, 2023Apr 12, 2023
qonnx
Public
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
Python
•
Apache License 2.0
•59•0•0•0•Updated Feb 27, 2023Feb 27, 2023
BlockMinifloatMAC
Public
Block minifloat MAC unit for single block GEMM
C
•1•0•0•0•Updated Feb 20, 2023Feb 20, 2023
SNNQuantPrune
Public
The Hardware Impact of Quantization and Pruning for Weights in Spiking Neural Networks
Python
•4•0•0•0•Updated Feb 9, 2023Feb 9, 2023
TENNA
Public
TENNA: Tiny Embedded Neural Network Accelerator
MIT License
•1•1•0•0•Updated Jan 16, 2023Jan 16, 2023
CHARM
Public
CHARM: Composing Heterogeneous Accelerators for Matrix Multiply on Versal ACAP Architecture (Full Paper accepted to FPGA2023!)
C++
•
MIT License
•24•0•0•0•Updated Jan 12, 2023Jan 12, 2023
BARVINN
Public
BARVINN: A Barrel RISC-V Neural Network Accelerator: https://barvinn.readthedocs.io/en/latest/
Tcl
•
MIT License
•16•0•0•0•Updated Jan 6, 2023Jan 6, 2023
awesome-model-quantization
Public
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving th…
240•0•0•0•Updated Dec 17, 2022Dec 17, 2022
iob-versat
Public template
Coarse Grained Reconfigurable Array
Verilog
•
MIT License
•15•0•0•0•Updated Dec 7, 2022Dec 7, 2022
Magicube
Public
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
C++
•
GNU General Public License v3.0
•16•0•0•0•Updated Nov 23, 2022Nov 23, 2022
rknn-toolkit
Public
RKNN from Rockchip
Python
•
BSD 3-Clause "New" or "Revised" License
•187•0•0•0•Updated Nov 21, 2022Nov 21, 2022
antares
Public
Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL f…
Python
•
Other
•48•1•0•0•Updated Nov 15, 2022Nov 15, 2022
awesome-real-time-AI
Public
This is a list of awesome edgeAI inference related papers.
9•1•0•0•Updated Nov 8, 2022Nov 8, 2022
nitta
Public
NITTA - Tool for Hard Real-Time CGRA Processors
Haskell
•
BSD 3-Clause "New" or "Revised" License
•10•0•0•0•Updated Nov 2, 2022Nov 2, 2022
braggHLS
Public
PyTorch model to RTL flow for low latency inference
SystemVerilog
•
MIT License
•12•6•0•0•Updated Sep 4, 2022Sep 4, 2022
gamma
Public
Python
•
MIT License
•19•0•0•0•Updated Sep 4, 2022Sep 4, 2022

ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deepware

All

All

165 repositories

tiny-tpu

onnx-simplifier

tvm-vta

fpga-npu

fpga-snntorch-spike

neureka

redmule

SNN-DSE

acceltran

halutmatmul

xla

nngen

RiscV_CPU_with_Accelerator

garnet

Awesome-Embeded-AI

qonnx

BlockMinifloatMAC

SNNQuantPrune

TENNA

CHARM

BARVINN

awesome-model-quantization

iob-versat

Magicube

rknn-toolkit

antares

awesome-real-time-AI

nitta

braggHLS

gamma

All

All

Repositories list

165 repositories