gemini-audio
Guide for implementing Google Gemini API audio capabilities - analyze audio with transcription, summarization, and understanding (up to 9.5 hours), plus generate speech with controllable TTS. Use when processing audio files, creating transcripts, analyzing speech/music/sounds, or generating natural speech from text.
Packaged view
This page reorganizes the original catalog entry around fit, installability, and workflow context first. The original raw source lives below.
Install command
npx @skill-hub/cli install kienhaminh-speed-reader-gemini-audio
Repository
Skill path: .claude/skills/gemini-audio
Guide for implementing Google Gemini API audio capabilities - analyze audio with transcription, summarization, and understanding (up to 9.5 hours), plus generate speech with controllable TTS. Use when processing audio files, creating transcripts, analyzing speech/music/sounds, or generating natural speech from text.
Open repositoryBest for
Primary workflow: Ship Full Stack.
Technical facets: Full Stack, Backend.
Target audience: Development teams looking for install-ready agent workflows..
License: Unknown.
Original source
Catalog source: SkillHub Club.
Repository owner: kienhaminh.
This is still a mirrored public skill entry. Review the repository before installing into production workflows.
What it helps with
- Install gemini-audio into Claude Code, Codex CLI, Gemini CLI, or OpenCode workflows
- Review https://github.com/kienhaminh/speed-reader before adding gemini-audio to shared team environments
- Use gemini-audio for development workflows
Works across
Favorites: 0.
Sub-skills: 0.
Aggregator: No.