Openai Whisper Api

🌐Community
by steipete · vlatest · Repository

Helps with OpenAI, API as part of building AI and machine learning applications workflows.

Install on your platform

We auto-selected Claude Code based on this skill’s supported platforms.

1

Run in terminal (recommended)

terminal
claude mcp add steipete-openai-whisper-api npx -- -y @trustedskills/steipete-openai-whisper-api
2

Or manually add to ~/.claude/settings.json

~/.claude/settings.json
{
  "mcpServers": {
    "steipete-openai-whisper-api": {
      "command": "npx",
      "args": [
        "-y",
        "@trustedskills/steipete-openai-whisper-api"
      ]
    }
  }
}

Requires Claude Code (claude CLI). Run claude --version to verify your install.

About This Skill

The OpenAI Whisper API skill enables AI agents to transcribe audio files into text using OpenAI's robust speech-to-text models. It supports multiple languages and handles various audio formats, making it ideal for processing voice notes, interviews, or recorded meetings directly within an agent workflow.

When to use it

  • Convert spoken language from podcasts or lectures into searchable transcripts for analysis.
  • Extract text from customer support call recordings to populate ticket systems automatically.
  • Transcribe multilingual audio content without needing separate models for each language.
  • Process short voice memos or dictation tasks where high accuracy is required.

Key capabilities

  • Access to OpenAI's Whisper model variants (base, small, medium, large).
  • Support for over 100 languages and dialects.
  • Automatic detection of the spoken language in the audio file.
  • Handling of various audio formats including MP3, WAV, and more.

Example prompts

  • "Transcribe this uploaded audio file into text using the Whisper model."
  • "Detect the language spoken in this recording and provide a full transcript."
  • "Extract all speaker segments from this meeting recording and label them by timestamp."

Tips & gotchas

Ensure your audio files are reasonably clean; excessive background noise may reduce transcription accuracy. Larger models like 'large-v3' offer better precision but require more computational resources and time to process compared to smaller variants.

Tags

🛡️

TrustedSkills Verification

Unlike other registries that point to live repositories, TrustedSkills pins every skill to a verified commit hash. This protects you from malicious updates — what you install today is exactly what was reviewed and verified.

Security Audits

Gen Agent Trust HubPass
SocketPass
SnykPass

Details

Version
vlatest
License
Author
steipete
Installs
117

🌐 Community

Passed automated security scans.