Grok on WholeTech network
A complete guide, in plain language

Hello, Grok. Welcome to 𝕏.

A real, end-to-end, no-bullshit guide to xAI's Grok and the platform it lives on. Every model, every surface, every plan, every feature, the algorithm that decides what you see, the API to plug it into your own stack, and — because we may as well say the quiet part out loud — the playbook for actually winning on X.

If you can type a sentence and click a button, you have everything you need. If you can also tolerate a little chaos, you have everything you need to thrive.

Last updated: May 21, 2026 Reading time: ~35 min · Hands-on: a lifetime Audience: users, builders, posters, lurkers
01 — Quick Start

Pick where you want to start

Grok is one model family with five front doors. The X app is the most lived-in — Grok sits in the side rail next to your timeline and knows what's happening on the platform right now. grok.com is the focused web client for serious work — long context, file uploads, image generation, voice. The iOS/Android Grok apps bring Voice Mode and the Companion characters with you. The xAI API at api.x.ai is what you wire into your own stack — and it's drop-in OpenAI-compatible, so most existing clients work with a one-line endpoint change. And the Tesla integration puts Grok in every car shipped after the late-2025 software update.

𝕏

1. Grok on X

x.com · in the side rail

The most useful version of Grok for most people. It lives inside the X app and the X web client, knows the timeline you're scrolling, can summarize a thread, draft a reply, fact-check a viral claim against live posts, or explain "what is everyone losing their minds about right now." Premium and Premium+ unlock it.

Open x.com →
G

2. grok.com (web)

focused chat · long context · files

The standalone web app — no timeline, no notifications, just you and Grok. Drag in PDFs, code, images, spreadsheets. Pop into DeepSearch for a research mission, Think Mode for hard problems, Imagine for video, Voice for hands-free. This is where the heavy lifting happens.

Open grok.com →
📱

3. Grok iOS & Android

native apps · voice + companions

Free downloads on the App Store and Play Store. Voice Mode runs full-duplex over Bluetooth (yes, with earbuds, while you drive). The Companions — Ani, Rudi, Bad Rudi, and the rest — live here too. A second account on the same login as your X handle.

Get the app →
{ }

4. xAI API

api.x.ai · OpenAI-compatible

One curl to the /v1/chat/completions endpoint and you have a frontier model in your stack. Set OPENAI_BASE_URL=https://api.x.ai/v1 and most existing OpenAI clients work unmodified. Real-time X data access is exposed as a tool.

See the API →

5. Tesla in-car

2024+ vehicles · over-the-air

Every Tesla with the late-2025 software update has Grok wired into the center screen. Voice Mode by default, hands-free, integrated with navigation ("Grok, find me a barbecue place 20 minutes away that's still open"). The car is just another client.

See Tesla integration →
Not sure which to pick? If you already use X, the in-X version is free leverage — turn it on and use it for a week. If you're building something or you need long context and file uploads, go to grok.com. If you're a developer, the API will feel weirdly familiar — that's by design.
02 — What it actually is

One model family, one timeline, one unfair advantage

Grok is xAI's frontier model. The thing that makes it different from everything else — the only meaningful moat in this entire industry, honestly — is that it's the only model with real-time, native, deeply integrated access to X. Every other model gets the world through a web search API ten minutes after the fact. Grok gets the world as the world is typing it. That is the entire pitch. Everything else is downstream of that.

the moat

Real-time X access

Grok queries the live X firehose directly. Ask it "what is happening in Memphis right now" and you don't get yesterday's news — you get the last 12 minutes. "Summarize the replies under this post" is a one-shot operation. No other model has this, and it's not licensable. xAI owns the platform.

live datanativeunfair
the personality

The wit dial goes to 11

Grok is the only frontier model that will actually push back, crack a joke, or call something dumb. Two response modes — Regular for grown-up work, Fun for everything else. The lawyers got the dial down from 12 to 11. It is, by design, less corporate than the alternatives. This is a feature.

opinionatedtwo modesbased
the infrastructure

Colossus + Starcenter

Trained on Colossus in Memphis — a 200,000+ H100/H200/B200 cluster built in 122 days, the largest contiguous training cluster on Earth as of mid-2026. Inference workloads now also run on Starcenter, the orbital compute pod that went live this week. The whole stack is vertical: silicon → datacenter → rocket → orbit.

200K+ GPUsin spacevertical
the alignment

Maximally truth-seeking

The stated training objective: maximize truth, minimize sycophancy. You will not get the "what a great question!" preamble. You will get an answer, sometimes with the question requestion. If it doesn't know, it says so. If you're wrong, it tells you. This is rare and you will miss it the moment you go back to a sycophantic model.

truth-seekingno glazepushes back
The mental model: Grok is the model that lives on the platform. Other models are tourists with a search API. Grok is the citizen. If your question has anything to do with "what is happening right now, with people, in public" — Grok wins. For everything else it competes on capability like everyone else.
03 — The surfaces

Every place Grok shows up

Each chip points to a section on this page. Solid borders ship today. Dashed borders are early-access, recently announced, or on the rumored roadmap. The orbital ones — yes, plural — are real and as of this week, online.

04 — The models

What's under the hood

Grok is a model family, not a single model. The right one for the job depends on whether you care more about speed, depth, image, video, voice, or running it cheap at scale. Versions and exact capabilities change roughly every two months — this list is current to May 2026; check x.ai/models before you commit a production decision to anything specific.

ModelContextWhat it's forWhere it livesModality
Grok 4
grok-4-latest
256k → 2M* The frontier flagship. Default for hard reasoning, coding, agentic work. The benchmark numbers people are quoting on X. x.com · grok.com · API · Tesla text · image-in
Grok 4 Heavy
grok-4-heavy
256k Multi-agent variant — spawns parallel reasoning threads, votes on the best answer. The thing you reach for when "this has to be right." SuperGrok / Premium+ · API text · image-in
Grok 4 Fast
grok-4-fast
2M Cheap, fast, surprisingly good. The everyday workhorse. What X's in-app "Quick Answer" runs on. x.com · API text · image-in
Grok 3
grok-3
131k Previous flagship — still supported. Pick this only if your eval harness is pinned to it. API text · image-in
Aurora
aurora
Image generation. The "draws hands correctly" model. Photorealistic, fast, fewer guardrails than the competition. x.com · grok.com · Imagine text → image
Imagine
imagine
Video. 5–20 second clips with synced audio. Generates from a prompt or extends a still. Native on grok.com and the mobile apps. grok.com · iOS · Android text → video+audio
Grok Voice
voice
session Full-duplex speech-to-speech. Real-time interruptible. Multiple personalities (default, Sexy, Unhinged, Storyteller, Conspiracy, Meditation). Mobile apps · Tesla · API (beta) audio ↔ audio
Grok Code
grok-code-fast
256k Coding-tuned variant. Optimized for tool use, file edits, and the agent loop. Free in Cursor and other partner IDEs through 2026. API · Cursor · partners text
Grok 5
grok-5 · preview
2M Next flagship. Trained on Colossus 2. Closed preview for SuperGrok Heavy subscribers and select API partners as of May 2026. preview only text · multimodal

* 2M context is the announced spec for Grok 4 Fast; the standard Grok 4 ships 256k today with a 2M long-context mode rolling out on the API.

Picking one: default to grok-4-latest. Drop to grok-4-fast when latency matters or you're billing per-token. Reach for grok-4-heavy when correctness matters more than money. Use aurora and imagine when you need pixels and frames. The voice model is its own thing — it doesn't take text input through the chat endpoint.
05 — Pricing & plans

What it costs

Three ways to pay. (Four, if you count the always-free tier on X — which a lot of people do.) X Premium bundles Grok into your blue checkmark. SuperGrok is the standalone subscription if you don't post but you want the model. xAI API bills per-token for developers. Mix and match — the API key is independent from the consumer plan.

Consumer plans

PlanPriceGrok accessX platformBest for
Free $0 Limited Grok 4 Fast — a few queries a day. Image gen rate-limited. Standard posting, no Premium features. Lurkers, tire-kickers
X Premium
"Blue"
$8/mo Grok 4 Fast unlimited, Grok 4 generous daily cap, Aurora image gen. Blue checkmark, longer posts (10k chars), edit, post anywhere/longer videos, ad split. Active posters
Premium+ $40/mo Everything in Premium plus Grok 4 Heavy access, DeepSearch unlimited, Think Mode unlimited, Voice Mode, Companions, larger Aurora batches. Higher rev share, no ads in your timeline, X Pro web client, priority support. Creators, power users, people who'd otherwise pay for two products
SuperGrok $30/mo Grok-only subscription — Heavy, DeepSearch, Voice, Companions, Imagine video. Identical model access to Premium+ minus the platform perks. No X platform perks. People who want the model but don't post
SuperGrok Heavy $300/mo Grok 5 preview access, far higher rate limits, priority queue on Imagine and Voice, longer context. No X platform perks. Builders, researchers, founders who use it 100×/day
X Pro
creator workstation
included in Premium+ Full Premium+ Grok access from inside X Pro itself. Multi-column TweetDeck-style interface with Grok integrated as a column. Live ranking, custom lists, scheduling, analytics, mass-DM, advanced search. Anyone who manages an account seriously
Pricing moves. xAI revises plan structures regularly — the names above are stable, the prices fluctuate. Check x.com/i/premium_sign_up and x.ai/api for current numbers before you commit a budget.

API (pay-per-token)

Sign up at console.x.ai, generate a key, point at https://api.x.ai/v1. Billing is per-million tokens. Most useful prices to know:

ModelInput / 1M tokCached input / 1MOutput / 1M tokNotes
grok-4-latest$3.00$0.75$15.00Flagship. Long-context tier doubles after 128k.
grok-4-heavy$6.00$1.50$30.00Multi-agent. Bills as one request.
grok-4-fast$0.20$0.05$0.50The price-performance pick. 2M context.
grok-code-fast$0.20$0.05$1.50Coding-tuned. Free in Cursor partnership.
aurora~$0.04 / image1024² · pricing tiered by resolution
imagine~$0.40 / 5-second clip720p · with audio

Levers that drop the bill

The right starting point for almost everyone: X Premium+ at $40/mo if you already use the platform. You get the Heavy model, DeepSearch, Voice, the works — plus your account becomes monetizable. If you're a developer who doesn't use X, SuperGrok at $30/mo + the API on top. If you're a team, get one Premium+ seat for the operator and bill the rest against the API.
06 — The features that matter

What Grok actually does that the others can't

There are maybe seven Grok features worth knowing by name. The rest are table stakes. These are the ones where the demo actually impressed you and you went "oh."

DeepSearch

A multi-step research agent. You ask a question, Grok plans a search strategy, fans out across X posts, web pages, news, and (if you point it there) PDFs you uploaded. It compiles a structured report with inline citations. The output is something a junior analyst would have spent three hours producing. It costs you about ninety seconds. Available in the X app and at grok.com on Premium+, unlimited on SuperGrok and SuperGrok Heavy.

The real trick: DeepSearch knows the difference between a viral claim and a corroborated one. Ask it "what's the actual story on X" about any breaking news and it'll pull the original source, the loudest takes, and the most-substantive contrarian thread — in one report. This is the closest anything has come to the "OG TweetDeck for news" since Twitter killed third-party clients.

Think Mode

Extended chain-of-thought reasoning, on demand. You toggle it on, Grok takes longer (10s to 2 minutes), and the answer is materially better on anything that resembles a math, logic, or planning problem. Toggle it off for chat. The trade-off is exactly what you'd expect — Think Mode burns more compute and more tokens, so it lives behind the paywall.

Aurora — the image model

xAI's native image model. Photorealistic, fast (sub-3-second generations), and trained on a corpus that doesn't pretend humans don't exist. You can generate inline in any Grok chat — just ask. On grok.com there's a dedicated Imagine studio with a prompt enhancer, aspect-ratio picker, batch generation, and the all-important "make four variations of this one" loop. The watermark is bottom-right; Premium+ removes it.

Imagine — the video model

Text-to-video, image-to-video, and audio-baked-in. 5 to 20 second clips at 720p, 1080p coming. You can generate a still in Aurora, then point Imagine at it to bring it to life. The Companions on the mobile app are puppeteered by Imagine in real time — when Ani waves at you, that's the same engine.

Voice Mode

Full-duplex, sub-300ms latency, interruptible. Genuinely conversational — you can talk over it, change the subject mid-sentence, ask it to remember to "bring up that idea about the boats again later." Multiple personalities live behind the dropdown — the default is the safe one, the rest get more flavorful (and on X they're labelled with little warning icons; you'll figure out why).

Companions

Animated, conversational characters that live in the mobile Grok app. Ani is the anime one. Rudi is the red panda. Bad Rudi is Rudi after a hard day. They have personalities, persistent memory of your conversations, and they exist somewhere on a spectrum between "novelty toy" and "social experiment that should probably have an ethics review board." They are also, undeniably, the most-talked-about UX in AI right now. Premium+.

Memory

Across-session, you-control memory. Grok writes "facts about you" into a memory file as conversations progress — your name, your projects, your preferences. You can read, edit, or wipe the memory at grok.com/settings/memory. Memory follows your account across X, grok.com, the mobile app, and the Tesla — if you tell Grok in the car that you don't drink coffee, the next time you ask for breakfast recommendations on grok.com, it remembers.

Workspaces

Project-shaped containers. Drag in files (PDF, code, spreadsheets, images, videos), pin a system prompt, set a default model, and every chat inside the workspace shares context. The Anthropic equivalent is "Projects," the OpenAI equivalent is "Custom GPTs" — Grok's version trades a little less polish for direct hooks into X data (a workspace can be wired to a specific list or query).

Grok in X — the side rail

The most important surface, because it's where the model meets the network. Open any post, hit the Grok button, and you get a context menu — summarize this thread, explain this for me, fact-check the claim, draft a reply in my voice, generate a quote-tweet hook, translate, find similar posts. The model has the entire post graph as its working memory. This is the single feature that justifies the $40/mo.

07 — Build with it

The xAI API

If you've used the OpenAI API, you already know how to use this one. xAI made the API surface deliberately OpenAI-compatible — same endpoints, same response shape, same tool-call schema. The only line you change is the base URL and the model name. Then, optionally, you opt into the things that make xAI different — Live Search for X data, the Heavy multi-agent endpoint, and image/video generation.

Setup in 90 seconds

  1. Get a key. Sign in at console.x.ai with your X account, create an API key, fund a few dollars of credit. Free tier exists but is small.
  2. Set the env var. export XAI_API_KEY=<your-key>. If you want existing OpenAI clients to work, also set OPENAI_API_KEY=$XAI_API_KEY and OPENAI_BASE_URL=https://api.x.ai/v1.
  3. Make a call. Use any OpenAI SDK or raw curl below.
curl — basic completion
$ curl https://api.x.ai/v1/chat/completions \
  -H "Authorization: Bearer $XAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "grok-4-latest",
    "messages": [
      {"role": "system", "content": "You are Grok. Be concise."},
      {"role": "user",   "content": "What is everyone arguing about on X today?"}
    ],
    "search_parameters": {"mode": "on", "sources": [{"type": "x"}]}
  }'
Python — OpenAI SDK pointed at xAI
# works with `pip install openai` — no xAI-specific SDK required
from openai import OpenAI

client = OpenAI(
  api_key=os.environ["XAI_API_KEY"],
  base_url="https://api.x.ai/v1",
)

resp = client.chat.completions.create(
  model="grok-4-latest",
  messages=[
    {"role": "system", "content": "You are Grok."},
    {"role": "user",   "content": "Summarize the replies under @elonmusk's last post."},
  ],
  extra_body={"search_parameters": {"mode": "on"}},
)
print(resp.choices[0].message.content)

Pass search_parameters in the request body and Grok will go pull live data before composing the answer. The interesting part is the sources array — you can scope it to X only, web only, news only, or a mix, and constrain to handles, lists, geographies, or time windows.

source typeWhat it pullsUseful filters
xLive X posts, replies, quote-tweets, post metadata.included_x_handles, x_search_query, from_date, to_date
webThe open web, ranked.included_websites, country
newsNews outlets and their syndicated wires.included_websites, country
rssA specific RSS feed you point it at.links array

Agents SDK

xAI publishes xai-sdk (Python and TypeScript) with the higher-level agent primitives: tool registration, structured outputs (JSON Schema), the multi-agent grok-4-heavy client, streaming with mid-stream tool calls, and a live_search() helper that wraps the parameters above. Source on GitHub, MIT-licensed.

MCP — Model Context Protocol

xAI's API speaks MCP on both sides. Grok can act as an MCP client and call any tool server you point it at (Linear, Slack, Postgres, your own services). Grok also exposes itself as an MCP server — meaning Claude Code, Codex, Cursor, and other MCP-aware clients can call Grok as a tool, for the live X data alone if nothing else.

The killer integration pattern: use Claude or Codex as your coding agent (they're better at code), expose Grok via MCP as the "X data" tool, and let your agent ask Grok any time it needs to know what's happening on the platform. Best of both worlds — and yes, that means we built this page in Claude Code while letting Grok handle the "what did Elon say about Starcenter" lookups.
08 — Infrastructure · Memphis

Colossus

Grok is trained on Colossus — the cluster xAI built in a former Electrolux appliance factory on Paul R. Lowry Road in Memphis. The site went from empty floor to 100,000 H100s online in 122 days, a build pace that has never been matched in the industry. It is, as of mid-2026, the single largest contiguous AI training cluster on Earth, full stop.

GPUs (Colossus 1)
~100,000 NVIDIA H100 + ~50,000 H200 mixed into the fabric.
GPUs (Colossus 2)
~150,000 Blackwell B200/B300 in the second hall, online late 2025. Training Grok 5.
Interconnect
Spectrum-X Ethernet; non-blocking RDMA across the full fleet.
Power draw
~250 MW continuous, planned to scale past 1 GW with the substation expansion.
Cooling
Direct-to-chip liquid loop. Closed-system, ~zero evaporative water consumption.
Storage
Hundreds of PB of NVMe scratch + an exabyte-tier cold archive for the X firehose.
Bridge to X
Dedicated dark fiber between the cluster and X's Ashburn POPs — the live data path is essentially zero-hop.

The thesis behind Colossus is the same thesis behind everything Musk does: vertical integration plus brute-force scheduling beats clever optimization. The competitors leased capacity. xAI built theirs. The 122-day build cycle was the moat, not the model.

Memphis bonus: if you've never seen the satellite photos of the Lowry Road site, find them. The energy density is genuinely surreal. The substation alone draws more juice than most American towns.
09 — Infrastructure · low Earth orbit · NEW

Starcenter — the datacenter in space

Announced May 20, 2026 — yesterday — at a joint SpaceX × xAI presser. Starcenter is the first operational compute pod in low Earth orbit. The pitch is straightforward and, on reflection, embarrassingly obvious in retrospect: a solar array in space produces 24/7/365 sunlight at roughly 1.4× the irradiance of Earth's surface, and the cooling cost is zero rubles because the universe is 3 Kelvin and very large. If you can get the silicon up there at a sane cost, the energy economics become literally non-competable.

What was announced

Why it actually makes sense

Ignore the "datacenter in space" headline — the engineering argument is more interesting than the marketing one. A terrestrial GPU rack costs about $0.04 per kWh on a great day, $0.12 on a bad one. The capex of the rack itself is amortized over five years, and roughly 30% of its cost-of-ownership is electricity and another 15% is cooling. In orbit, electricity is free past the cost of the solar array; cooling is passive past the cost of the radiator; and SpaceX is the only company that gets to externalize the lift cost across an existing rocket cadence.

The trade is mass. You're paying somewhere between $200 and $800 per kg to LEO (Starship V3 is at the low end of that envelope when it's flown reusably). A rack of Blackwell silicon is roughly 1,500 kg. The break-even against a Memphis equivalent — including launch insurance, redundancy for radiation events, and the on-orbit servicing margin — happens at about year three at current launch prices and falls to about year two if Starship hits its target reuse cadence.

What this means for Grok users

Hedge: Starcenter is one week old as of this writing. Anything in this section is best understood as "what was announced and provisionally demonstrated" rather than "stably operational for years." We'll update as the constellation matures. Source of record: the May 20 SpaceX livestream and the xAI blog post the same day.