AI Chat — DevUtilities

§ 01 Capabilities

Nine quiet things
it does well.

i

Markdown that breathes

Rebuilt on the Textual rendering engine. Headings, tables, nested lists and blockquotes are typeset, not just displayed. Streaming updates land without layout flicker.

ii

One-click code copy

Every fenced block carries its own clipboard button. Grab a command, a config, or a snippet without touching your prompt history.

iii

OpenAI Responses API

Full SSE streaming with reasoning output. Compatible with GPT-5-pro and any model that requires the new endpoint. Toggled per-model via Use Responses API.

iv

DeepSeek deepthink

Models stream their chain of thought into a collapsible section above the answer. Read it, ignore it, fold it away — your call.

v

Bring your own model

Any OpenAI-compatible endpoint. Custom headers, custom auth, self-hosted URLs. Built-in list reconciles with current defaults each launch.

vi

Persistent sessions

Multiple chats, each with its own model and history. Pick up the conversation that was interrupted last Tuesday, not a fresh prompt every time.

vii

Stop, without loss

Hit ⎋ mid-stream and the partial response is preserved exactly as it landed. The streaming indicator goes away; your context doesn't.

viii

Image generation

Generate diagrams and visual references through the Responses API, refined turn by turn — the conversation context comes with you.

ix

Web search, when asked

Optional real-time search for fresh documentation and current best practice. Off by default; opt in per session, never silently.

§ 02 A typical session

Four moves from prompt to paste.

Pick a model

Choose a built-in (GPT-5, DeepSeek), or add your own with endpoint, headers and key.

Settings · ⌘ , · AI · Models

Ask the question

Type, paste a stack trace, or drop in a code block. Reasoning models stream their thinking first, in a panel you can fold.

Send · ↩

Take the code

Every fenced block has its own copy button. The last block is one shortcut away — no selection, no scroll-back.

Last block · ⌘ ⇧ C

Keep going

The thread persists across launches with its model and history intact. Start a new session for a new line of work; the old one is still on disk.

New chat · ⌘ N

§ 03 Made for these tasks

The work it actually does.
Not generic chat — developer chat.

i — Review

Code reads & second opinions

Paste a function, ask for a critique. The Markdown renderer keeps diffs, lists, and fenced suggestions intact instead of mashing them into prose.

ii — Debug

Stack traces, line by line

Drop the error, attach the relevant block, walk the cause. Reasoning models surface their hypothesis path before the fix lands.

iii — Architecture

Design conversations

Trade-offs, patterns, the boring questions that decide a system. Persistent sessions mean tomorrow's thread can pick up today's context.

iv — Learn

Concept explanations

From a half-understood paper to working examples. Fold the deepthink panel away once the answer clicks; reopen it later if you need to retrace.

v — Optimise

Profile-shaped questions

Bring numbers, get specific suggestions. The assistant doesn't guess at hot paths it can't see — it asks for the data, then proposes.

vi — Secure

Threat models & review

Walk an auth flow, audit a snippet, reason about a CVE. Web search is opt-in for when documentation needs to be current, not stale.

§ 04 Questions, answered

Things people ask before they download.

What's new in v2.12?

Three threads. The Markdown renderer migrated to Textual — better code highlighting, no streaming flicker, accurate tables. Every code block carries a copy button. And the OpenAI Responses API is supported for GPT-5-pro and reasoning models, with full SSE streaming.

Which models are supported?

Built-ins include GPT-5, DeepSeek v4-flash and v4-pro, and GPT-5-pro via the Responses API. Beyond that — any OpenAI-compatible endpoint. Add a model under Settings → AI → Models → Add Model: endpoint, key, model name, optional custom headers.

How does the copy button work?

Every fenced code block in the response renders with a copy button in its top-right corner. Click it; the entire snippet is on your clipboard. ⌘ ⇧ C grabs the most recent block from anywhere in the chat.

What is the OpenAI Responses API?

OpenAI's newer endpoint, required for GPT-5-pro and any model that exposes reasoning output. Toggle it per model with Use Responses API in the model settings — DevUtilities handles the SSE stream, including the thinking section.

How does DeepSeek reasoning surface in the UI?

DeepSeek v4-pro streams its chain of thought as a collapsible panel above the final answer. The panel opens by default; collapse it from the brain icon. The same panel is used for any Responses-API model that exposes thinking output.

Is conversation history saved?

Yes. Each chat session is persisted on disk, with its own model selection and history. Sessions survive launches, updates, and the rest. Delete a session and its history goes with it.

How do I stop a streaming response?

The stop button cancels the request immediately. Anything the model has already produced is preserved as-is — the streaming indicator is removed and the partial response stays in the thread.

Where do my API keys live?

In macOS UserDefaults on your machine. Keys never reach our servers — there are no servers. The chat goes directly from DevUtilities to the endpoint you configured. See the privacy policy for the full picture.

§ 05 Companion tools

AI Chat, rewritten for builders.

Nine quiet things
it does well.

Markdown that breathes

One-click code copy

OpenAI Responses API

DeepSeek deepthink

Bring your own model

Persistent sessions

Stop, without loss

Image generation

Web search, when asked

Four moves from prompt to paste.

Pick a model

Ask the question

Take the code

Keep going

The work it actually does.
Not generic chat — developer chat.

Code reads & second opinions

Stack traces, line by line

Design conversations

Concept explanations

Profile-shaped questions

Threat models & review

Things people ask before they download.

Better with the rest of the shelf.

One assistant, twenty-three tools, one quiet window.

AI Chat, rewritten for builders.

Nine quiet things it does well.

Markdown that breathes

One-click code copy

OpenAI Responses API

DeepSeek deepthink

Bring your own model

Persistent sessions

Stop, without loss

Image generation

Web search, when asked

Four moves from prompt to paste.

Pick a model

Ask the question

Take the code

Keep going

The work it actually does. Not generic chat — developer chat.

Code reads & second opinions

Stack traces, line by line

Design conversations

Concept explanations

Profile-shaped questions

Threat models & review

Things people ask before they download.

Better with the rest of the shelf.

AI Translate

JSON Formatter

HTTP Client

Text Compare

JWT

Random String

One assistant, twenty-three tools, one quiet window.

Nine quiet things
it does well.

The work it actually does.
Not generic chat — developer chat.