Domains · where we collaborate, in priority order
Safety
Alignment and oversight. Defense evals fit here. The first call on everything.
Defense
High-stakes capability and red-teaming, including weapons-capability red-teaming.
Science
Bio, pharma, clinical-trials automation, and fundamental research.
Commerce
Indexing workflows from real companies. What we are doing now.
Blog · research notes and method write-ups
Shelf Life
Representing a problem space in thirty pages.
Environments under RL
What our environments do to models when applied with RL.
Dense reward
Why step-by-step grading beats pass or fail.