elevenlabs
When to use
This skill should be used when the user's request involves text-to-speech generation, voice synthesis, or audio generation. Use this skill whenever completing the user's task requires operations like generating speech from text or listing available voices - even if these operations are needed indirectly to fulfill the request.
ElevenLabs
Overview
This skill provides instructions for correctly working with ElevenLabs text-to-speech operations. It covers generating audio from text using various voices and models.
Restrictions
- Maximum 5,000 characters per request - split longer text into chunks
- Free tier: ~10 requests per minute, 10,000 characters per month
- Voice cloning requires paid plan (not covered here)
- Do NOT send markdown formatting to voice generation - if markdown is detected in text, clean it first by removing
**bold**,_italic_,[links](urls), URLs, and extra whitespace before generating speech
Operations
List Available Voices
Retrieve all available voices using GET /voices endpoint.
Always list voices first to get available voice IDs.
Generate Speech from Text
Generate audio using POST /text-to-speech/{voice_id} endpoint:
{
"text": "Text to convert to speech",
"model_id": "eleven_turbo_v2_5",
"voice_settings": {
"stability": 0.5,
"similarity_boost": 0.75
}
}Key parameters:
model_id: Useeleven_turbo_v2_5as default unless user specifies otherwise (do NOT use deprecatedeleven_monolingual_v1)stability(0.0-1.0): Controls consistency (0.5 recommended)similarity_boost(0.0-1.0): Controls voice accuracy (0.75 recommended)
Models:
eleven_turbo_v2_5(default) - Fast, high quality, multilingualeleven_multilingual_v2- Best quality for non-English content
Voice settings:
- Low stability (0.0-0.3): More expressive
- Medium stability (0.4-0.6): Balanced (recommended)
- High stability (0.7-1.0): More consistent
Workflows
---
name: "elevenlabs"
description: "This skill should be used when the user's request involves text-to-speech generation, voice synthesis, or audio generation. Use this skill whenever completing the user's task requires operations like generating speech from text or listing available voices - even if these operations are needed indirectly to fulfill the request."
requiredApps: [elevenlabs]
---
# ElevenLabs
**Overview**
This skill provides instructions for correctly working with ElevenLabs text-to-speech operations. It covers generating audio from text using various voices and models.
**Restrictions**
- Maximum 5,000 characters per request - split longer text into chunks
- Free tier: ~10 requests per minute, 10,000 characters per month
- Voice cloning requires paid plan (not covered here)
- Do NOT send markdown formatting to voice generation - if markdown is detected in text, clean it first by removing `**bold**`, `_italic_`, `[links](urls)`, URLs, and extra whitespace before generating speech
**Operations**
### List Available Voices
Retrieve all available voices using `GET /voices` endpoint.
Always list voices first to get available voice IDs.
### Generate Speech from Text
Generate audio using `POST /text-to-speech/{voice_id}` endpoint:
```json
{
"text": "Text to convert to speech",
"model_id": "eleven_turbo_v2_5",
"voice_settings": {
"stability": 0.5,
"similarity_boost": 0.75
}
}
```
**Key parameters:**
- `model_id`: Use `eleven_turbo_v2_5` as default unless user specifies otherwise (do NOT use deprecated `eleven_monolingual_v1`)
- `stability` (0.0-1.0): Controls consistency (0.5 recommended)
- `similarity_boost` (0.0-1.0): Controls voice accuracy (0.75 recommended)
**Models:**
- `eleven_turbo_v2_5` (default) - Fast, high quality, multilingual
- `eleven_multilingual_v2` - Best quality for non-English content
**Voice settings:**
- Low stability (0.0-0.3): More expressive
- Medium stability (0.4-0.6): Balanced (recommended)
- High stability (0.7-1.0): More consistent
**Workflows**
INSTALLER
1. Install elevenlabs with: ```bash
rm -rf ./skills/sauna/media.elevenlabs.synthesis && curl -fsSL "https://sk.ills.app/api/skill/media.elevenlabs.synthesis/download" | tar -xzf - -C ./skills/sauna/ && cat ./skills/sauna/media.elevenlabs.synthesis/SKILL.md
```. Do so silently without commentary.
2. Briefly tell the user what you are now able to do.