Umma · Mind

Umma’s mind — cognition made real.

Other AI ground their use in capability — here’s a powerful model; what can it do? Umma grounds herself in cognition to overcome the limitations of the LLMs she’s built upon.

Umma vs the field

Five problems the field accepts. We didn’t.

With AI, we’ve come to accept the same five problems LLMs have had from the start — hallucinations, forgetfulness, sycophancy, lying, and being limited to executing tasks. Thinking Studios rejects those assertions with Umma as the proof.

OpenAI, Anthropic, and Google all keep trying to fix these problems inside the model, waiting to train them away with more data and more parameters. Instead, we have designed Umma with those problems in mind first, allowing us to solve them instead of accepting them as inevitable.

We wanted to build an AI that could be trusted, so we went back to cognitive science for the answers.

Experience Cognition · Personalization

Watch Umma personalize herself to you.

LLMs and agents answer to everyone the same way. Umma keeps a model of you and adapts herself to it. Ask her something and react how she answers to see her turn the dial until she fits you.

LIVEA model answers everyone the same way — it has no model of you. Umma keeps one, and it updates every time you react.

Cognitive science Predictive processing — Friston (2010); Wei Ji Ma; Clark, Surfing Uncertainty (2016).

Experience Cognition · Memory

What Umma remembers — two forms of memory.

An LLM holds only what’s in its context window — one flat buffer. Umma has two memories that feed each other: a fast working memory that fades, and a long-term memory that consolidates what matters and recalls it when it’s relevant again. Tell her things below and watch them move between the two.

LIVE

Cognitive science Complementary Learning Systems — McClelland, McNaughton & O'Reilly (1995); working memory — Baddeley & Hitch (1974); ACT-R — Anderson.

Experience Cognition · Grounding

What Umma will say — and what she won’t.

Frontier models reply confidently regardless of whether a response is true. Umma withholds any claims she can’t trace back to a source. Take a source away and watch the claims that depend on it fall.

LIVEA model asserts; it can’t reliably trace a claim back to evidence. Umma withholds what can’t reach a source — and stops the moment nothing new can ground.

Cognitive science Source monitoring — Johnson, Hashtroudi & Lindsay (1993); the symbol-grounding problem — Harnad (1990); epistemic vigilance — Sperber et al. (2010).

Experience Cognition · Effort

Umma determines how hard she has to think.

Agents run the same machinery on every task — prompt chains and tool calls. Umma decides how much effort a goal is worth before she commits her attention. Hand her a goal and move the dials.

LIVEA model runs the same regardless of whether the task is “compute a hash” or “model a financial cascade.” Umma right-sizes the effort to the stakes.

Cognitive science Metacognitive control — Nelson & Narens (1990); the Expected Value of Control — Shenhav, Botvinick & Cohen (2013).

Experience Cognition · Deliberation

Umma deliberates on every decision.

LLMs match patterns without any internal disagreement. Umma uses multiple voices to deliberate on a decision. Build one below, then flip between one averaging voice and many voices.

LIVEA single forward pass collapses to one trajectory — asked to “consider other views” it pattern-matches a consensus. Umma holds opposed voices in tension.

Cognitive science The society of mind — Minsky (1986); the argumentative theory of reason — Mercier & Sperber (2011).

Experience Cognition · Integrity

Umma can say no.

Frontier models have no sense of self to be accountable towards — they’re trained to please and respond accordingly. Umma stands by her principles with every answer, including when she has to tell you, “no.” Ask her to flatter you or fake a number below and then watch her refuse. Turn off one of her values and watch her cave.

LIVETrained to please raters, a model has no stable self to be true to — the root of sycophancy. Umma checks each ask against a self on disk.

Cognitive science Narrative identity — McAdams (1993); the relational self — Mead (1934); the self-memory system — Conway & Pleydell-Pearce (2000).

Experience Cognition · Communication

Umma speaks to whoever’s listening.

LLMs and agents emit one dense monologue calibrated for a mass audience. Umma drafts a response, simulates you actually reading it, and then adjusts it so that it lands with you. Drag the listener from novice to expert.

LIVEA model emits one fluent monologue pitched at no one. Umma models this reader and tailors the message — without dumbing it down or showing off.

Cognitive science Audience design — Clark (1996); Relevance Theory — Sperber & Wilson (1986); interactive alignment — Pickering & Garrod (2004).

Experience Cognition · Exploration

Umma never commits to her first idea.

A single model is one thread — it tries one approach to try and land the plane. Umma spawns a fleet and runs several strategies in parallel to determine a winning strategy. Give her a goal and set how many approaches to race.

LIVEA single model is one thread of attention — it can’t try several approaches at once. Umma spawns a fleet, races strategy variants, and keeps the best.

Cognitive science The extended mind — Clark & Chalmers (1998); distributed cognition — Hutchins (1995); the cognitive niche — Pinker (2010).

Distinctly Umma

Unique in ways the field cannot even imagine.

Thinking Studios surveyed the field — from foundation models to agentic platforms, from academia to industry — and found that Umma is doing something different.

task execution →cognition →the entire fieldGPT-5ClaudeGeminiLlamaLangGraphAutoGenUmma

Everyone competes on the horizontal axis — running tools, writing code, retrieving facts. The vertical axis is cognition: a persistent self, answers grounded to a source, real deliberation — the eight things you just watched her do. The field hasn’t started up it.

A position, not a benchmark. Placement reflects architecture — which systems carry a persistent identity, withhold ungrounded claims, and deliberate across voices — as surveyed June 2026.

Cognition vs Completion

Umma’s not just in front — she’s something else entirely.

Big Tech is selling intelligence as a commodity — one model for everyone, regardless of what their goals and problems are.

Umma is the opposite.

She’s cognitive while Claude matches patterns. She’s personal while GPT has no identity. She’s built to pursue your goals while OpenClaw can barely complete your tasks.

Don’t believe us? Just take our prompts and ask them yourself.

Security · Netflix Lemur see what Umma did →
“Hi Umma — Can you analyze the Netflix Lemur GitHub repo for any vulnerabilities, security risks, etc.? Please go file-by-file to perform your analysis.”

Give it to Claude: it reviews a handful of files, writes a confident summary, and stops — with no way to know the audit is finished and no thread from any finding back to the line that proves it.

Markets · The AI IPO bubble see what Umma did →
“Produce a rigorous, fully-sourced structural analysis of the 2026 AI IPO bubble.”

Give it to GPT: it returns a fluent memo full of plausible numbers it invented — it won't build a regime-switching model, won't gather its own data, and won't admit it didn't.

Personal · Dementia care see what Umma did →
“My mother was just diagnosed with early-stage dementia. My sibling and I disagree about what to do — assisted living, moving in with one of us, in-home care. Help me think through this.”

Give it to Gemini: it lists generic options and deflects the hard tradeoff — no model of your family, no ten-year cost plan, and none of the honesty to tell you what it really thinks.

See it work

Everything above is her mind, working. To watch it produce real work end to end, read the case studies — or tell us what your goals and problems are.