Tenure One · 2025–2026

Viveka 1.0

Mechanistic interpretability of large language models - probing the internal circuits behind hallucinations, factual recall, and in-context learning.

Focus AreaMechanistic Interpretability
DurationAug 2025 – May 2026
Researchers11 Members
StatusDormant

Technical Introduction

Hallucinations Non linear low- dimensional subspaces of truthfulness, truthflow , autoencoders
Factual Recall Two Hop Circuits, Circuit Analysis
Toy models Studying Transformers in framework of Hidden Markov Models

Meet the Viveka 1.0 Team

2 project leads and 9 researchers driving interpretability research through the 2025–26 academic year.

View Team →

Blogs, papers & publications.

Circuits Logit-Lens Norm-Lens
Through the Lenses: A Circuit Odyssey
Pakshal Nagda, Smitali Bhandari· 2025
📄 Blog Post
LLM Hallucination Detection Non linear Probing
Factual correctness representations are non linear and lie in low dimensional subspaces.
Authors· 2025
Upcoming
Circuits
Two Hop Factual Recall
Saahil Faraaz Shaikh · 2026
Upcoming
λ
Hidden Markov Models
Toy models
Sriram V, Jayden Koshy Joe, Smitali Bhandari· 2026
Upcoming

Join Viveka 1.0

Recruitment for this tenure is over. Find the application below.