docs

>

Best for: Engineers and operators who need facts from docs fast without hunting through pages.

Engineering / planning-thinkingatomicfor-engineerslight-setupfrom-file

Skill file

Preview skill file
---
name: docs
description: >
  Find the right Deepgram documentation for any task. Use whenever someone needs help locating
  docs, understanding which API to use, or wants to ask questions about Deepgram. Covers all
  product areas: speech-to-text, text-to-speech, voice agents, audio intelligence, and
  self-hosted deployments.
---

# Deepgram Documentation

Find the right docs for what you're building with Deepgram.

## Ask AI

Have a question? Get answers from Deepgram's AI assistant at <https://developers.deepgram.com/ask-ai>.

## Documentation by Topic

### Speech-to-Text (STT)

Transcribe audio and video into text. Deepgram ships two actively maintained, next-gen model families — pick the one that matches your use case.

- **Nova** (`/v1/listen`) — general-purpose transcription (captions, subtitles, batch files, live streams). Rich feature set including intelligence overlays (diarize, summarize, sentiment, topics, intents).
- **Flux** (`/v2/listen`) — conversational-audio transcription for voice agents and interactive assistants. Built-in turn-taking (EOT events, mid-session reconfig).

Docs:
- [STT Getting Started (Nova)](https://developers.deepgram.com/docs/stt/getting-started)
- [Flux Quickstart](https://developers.deepgram.com/docs/flux/quickstart)
- [Nova 3 → Flux migration](https://developers.deepgram.com/docs/flux/nova-3-migration)
- [Flux language prompting](https://developers.deepgram.com/docs/flux/language-prompting)

### Text-to-Speech (TTS)

Convert text into natural-sounding speech.

- [Text-to-Speech Docs](https://developers.deepgram.com/docs/tts-rest)

### Voice Agent

Build conversational voice agents powered by Deepgram.

- [Voice Agent Docs](https://developers.deepgram.com/docs/voice-agent)

### Text and Audio Intelligence

Analyze text and audio for sentiment, topics, intents, summaries, and more.

- [Audio Intelligence Docs](https://developers.deepgram.com/docs/audio-intelligence)

### Self-Hosted Deployments

Run Deepgram on your own infrastructure.

- [Self-Hosted Introduction](https://developers.deepgram.com/docs/self-hosted-introduction)

### API Reference

Full reference for all Deepgram REST and WebSocket APIs.

- [API Reference](https://developers.deepgram.com/reference/deepgram-api-overview)

## SDK-Specific Skills

For language-idiomatic code patterns (imports, async idioms, error handling, type shapes), install the Deepgram SDK's own skills. Every Deepgram SDK publishes 7 product skills plus a maintainer skill:

```bash
npx skills add deepgram/deepgram-python-sdk     # Python
npx skills add deepgram/deepgram-js-sdk         # JavaScript / TypeScript
npx skills add deepgram/deepgram-java-sdk       # Java
npx skills add deepgram/deepgram-go-sdk         # Go
npx skills add deepgram/deepgram-rust-sdk       # Rust
npx skills add deepgram/deepgram-swift-sdk      # Swift
npx skills add deepgram/deepgram-kotlin-sdk     # Kotlin
npx skills add deepgram/deepgram-dotnet-sdk     # C# / .NET
npx skills add deepgram/deepgram-browser-sdk    # Browser TypeScript
```

## Related Deepgram skills

- `api` — consolidated REST + WebSocket API reference
- `recipes` — minimal runnable feature snippets per language
- `examples` — full integration examples with third-party platforms
- `starters` — runnable starter apps (framework × feature)
- `setup-mcp` — Deepgram MCP server installation

## MCP Server

For direct documentation querying from your AI coding tool, use `/deepgram:mcp` to set up the Deepgram MCP server.

Source

Creator's repository · deepgram/skills

View on GitHub

Security

Security checks in progress
Results will appear here once audits complete
Checked by 3 independent security firms
Does it try to trick the AI?Not yet checkedPending · Gen Agent Trust Hub
Does it sneak in hidden code?Not yet checkedPending · Socket
Does it have known bugs?Not yet checkedPending · Snyk