Blog — Sai Sasank Y

Posts tagged "LLMs"2 entries

Pretraining SmolLM-360M on a single A100 GPU within a 30-hour window, focusing on feasibility analysis, throughput measurement, and hardware efficiency optimization.

Feb 08, 2026

LLMs

Can LLM agents improve other LLM agents?

Exploring whether language model agents can enhance the performance of other LLM agents through a meta-benchmark approach.

Dec 01, 2024

Training a 360M Parameter Model with Performance Discipline

Can LLM agents improve other LLM agents?