← Back to Home

Projects

Systems. Robotics. Hardware. AI

TinyEdge: ML Compiler →

An end-to-end ML compiler that lowers MobileNetV2 through 4 IR layers with custom CUDA kernels, delivering a 4.67× inference speedup over PyTorch eager.

CUDA C++ PyTorch TVM MLIR LLVM

Advanced Out-of-Order RISC-V Processor →

Developed a high-performance RISC-V processor featuring advanced pipelining, branch prediction, and caching mechanisms.

RISC-V SystemVerilog Processor Design Computer Architecture

Enfield Engine →

A high-performance, multi-threaded game engine crafted in C++ with Lua scripting and advanced physics.

C++ Game Engine Box2D SDL Lua ImGui

Roll Over Beethoven: A Cloth Piano →

A portable cloth piano using capacitive cloth and pressure-sensitive gloves.

C Embedded Systems Audio Synthesis Real-time Processing

Robotics →

Algorithms for motion planning, kinematics, and control systems in robotics.

Robotics Path Planning Kinematics Control Systems PyTorch

Operating Systems →

Custom system calls, Unix shell, and memory management in C++.

Operating Systems C++ Linux Kernel

Deep Learning →

Implementation of neural networks and reinforcement learning algorithms in Python.

Deep Learning Neural Networks Computer Vision PyTorch Transformers

Networking →

Video CDN implementation and network diagnostic tools.

Networking Distributed Systems C++ C