Projects

Report on how Transformers and some Large Language Models (LLMs) work

This report focuses mainly on the architecture of Transformers, motivated by my fine-tuning project.
Limitations: The report has not included multimodal features and reasoning abilities using reinforcement learning.

Tutorial: How to run any light-weight LLMs locally on an Android phone

This is a tutorial providing a step-by-step guide on running light-weight LLMs locally on Android using Termux. It also explains the limitations with iOS support.

DeepSeek-R1 Fine-Tune on a Reasoning Dataset and Benchmark

A detailed report on fine-tuning the DeepSeek-R1 model on a reasoning dataset, including performance benchmarks and insights into model improvements.
The fine-tuning and benchmarking methods are posted on GitHub. Slides keeping track of the project timeline and results are also attached.

My Animals Classifier (Convolutional Neural Network)

An implementation of a convolutional neural network that classifies various animals. The project covers dataset preparation, model architecture, and performance evaluation.

OpenCV Object Detection

This project explores real-time object detection using OpenCV. It includes details on setup, algorithm selection (Haas Cascades and its variations are being used).
The GitHub repository also includes some basic operations and manipulations on images and videos.

Autonomous Car

An experimental project focused on developing an autonomous car system, where I collaborated with 3 other teammates. The project demonstrates sensor integration, in which we are given only 2 sensors, and autonomous decision-making.

Multimodal LLMs Work in Progress

A work in progress project exploring the integration of multimodal features into large language models, aiming to enhance user interaction with both text and visual data, especially images.