Agent Evaluation Benchmark

Agent Performance & Evaluation
Github

04-2026

End-to-end benchmarking system for evaluating LLM agents on reasoning, tool use, and accuracy

Project media 1
Project media 2
Project media 3

Multi-Step Agent System

Agent Systems & Applied AI
Github

02-2026

LangGraph-based multi-step reasoning agent with tool orchestration and memory

Project media 1
Project media 2
Project media 3

Model Serving & Inference System

Model & Infrastructure
Github

01-2026

vLLM-based high-performance inference stack with batching, streaming, and latency optimization

Project media 1
Project media 2
Project media 3

Model Adaptation Pipeline

Model & Infrastructure
Github

12-2025

LoRA-based fine-tuning, distillation, and post-training pipeline for domain-specific LLMs

Project media 1
Project media 2
Project media 3

ChunkViz

Open Source
Github

08-2025

LLM Tooling & Visualization

Project media 1
Project media 2
Project media 3

An open-source 3D Gaussian Splatting platform

Project media 2
Project media 3

SearchAI

Open Source
Github

08-2024

buildspace s5 - A Feature-Rich PyQt5 Browser with Integrated AI

Project media 1
Project media 2
Project media 3

Your Personalized Mental Health Companion

Project media 1
Project media 2
Project media 3