Why it matters
Too much of the market still treats evaluation, interpretability and reliability as secondary concerns. That works until teams need to confidently choose a model, mitigate hallucinations or explain how an AI decision was reached.
Our approach
We tackle these challenges from the ground up. Using a first-principles approach to evaluation, we build methods that rigorously assess AI systems at every stage: input, output and internal decision-making.
Blog
p-less Sampling: A robust LLM decoding strategy
In this research paper, we introduce a novel approach of sampling named 𝜌-less sampling for text generation in autoregressive models.
Blog
The next frontiers in AI — according to industry leaders
This research paper explores the key trends we think are particularly important in generative AI
From the labs
P-less Sampling: A robust hyperparameter-free approach for LLM decoding
p-less Sampling: A robust LLM decoding strategy
Evaluating LLM-generated summaries using the Lie algebra framework
The next frontiers in AI — according to industry leaders
Calculating uncertainty in generative AI
Evaluating LLMs using semantic entropy
LLM benchmarks, evals and tests
Decoding LLM uncertainties for better predictability
A surprisingly effective way to estimate token importance in LLM prompts
Probabilistic machine learning and weak supervision
A gentle introduction to machine teaching
TinySQL
Beyond linear steering: Unified multi-attribute control for language models
Turning up the heat: Min-p samling for creative and coherent creative outputs
Beyond I am sorry, I can’t: dissecting large language model refusal
Distribution-aware feature selection for SAEs
Towards transparent AI grading: Entropy as a signal for human-AI disagreement
Steering smarter
Partners and collaborations
Thoughtworks AI labs sit within a wider network of organizations spanning public AI research, semiconductor innovation, cloud platforms, open source and AI engineering.
These relationships strengthen the lab’s ability to contribute to the methods, tools and technical standards shaping reliable AI.
For partnerships and collaboration inquiries
email ai-labs@thoughtworks.com