When to use

This skill should be used when the user's request involves text-to-speech generation, voice synthesis, or audio generation. Use this skill whenever completing the user's task requires operations like generating speech from text or listing available voices - even if these operations are needed indirectly to fulfill the request.

ElevenLabs

Overview

This skill provides instructions for correctly working with ElevenLabs text-to-speech operations. It covers generating audio from text using various voices and models.

Restrictions

Maximum 5,000 characters per request - split longer text into chunks
Free tier: ~10 requests per minute, 10,000 characters per month
Voice cloning requires paid plan (not covered here)
Do NOT send markdown formatting to voice generation - if markdown is detected in text, clean it first by removing **bold**, _italic_, [links](urls), URLs, and extra whitespace before generating speech

Operations

List Available Voices

Retrieve all available voices using GET /voices endpoint.

Always list voices first to get available voice IDs.

Generate Speech from Text

Generate audio using POST /text-to-speech/{voice_id} endpoint:

{
  "text": "Text to convert to speech",
  "model_id": "eleven_turbo_v2_5",
  "voice_settings": {
    "stability": 0.5,
    "similarity_boost": 0.75
  }
}

Key parameters:

model_id: Use eleven_turbo_v2_5 as default unless user specifies otherwise (do NOT use deprecated eleven_monolingual_v1)
stability (0.0-1.0): Controls consistency (0.5 recommended)
similarity_boost (0.0-1.0): Controls voice accuracy (0.75 recommended)

Models:

eleven_turbo_v2_5 (default) - Fast, high quality, multilingual
eleven_multilingual_v2 - Best quality for non-English content

Voice settings:

Low stability (0.0-0.3): More expressive
Medium stability (0.4-0.6): Balanced (recommended)
High stability (0.7-1.0): More consistent

Workflows

                    ---
name: "elevenlabs"
description: "This skill should be used when the user's request involves text-to-speech generation, voice synthesis, or audio generation. Use this skill whenever completing the user's task requires operations like generating speech from text or listing available voices - even if these operations are needed indirectly to fulfill the request."
requiredApps: [elevenlabs]
---

# ElevenLabs

**Overview**

This skill provides instructions for correctly working with ElevenLabs text-to-speech operations. It covers generating audio from text using various voices and models.

**Restrictions**

- Maximum 5,000 characters per request - split longer text into chunks
- Free tier: ~10 requests per minute, 10,000 characters per month
- Voice cloning requires paid plan (not covered here)
- Do NOT send markdown formatting to voice generation - if markdown is detected in text, clean it first by removing `**bold**`, `_italic_`, `[links](urls)`, URLs, and extra whitespace before generating speech

**Operations**

### List Available Voices

Retrieve all available voices using `GET /voices` endpoint.

Always list voices first to get available voice IDs.


### Generate Speech from Text

Generate audio using `POST /text-to-speech/{voice_id}` endpoint:

```json
{
  "text": "Text to convert to speech",
  "model_id": "eleven_turbo_v2_5",
  "voice_settings": {
    "stability": 0.5,
    "similarity_boost": 0.75
  }
}
```

**Key parameters:**
- `model_id`: Use `eleven_turbo_v2_5` as default unless user specifies otherwise (do NOT use deprecated `eleven_monolingual_v1`)
- `stability` (0.0-1.0): Controls consistency (0.5 recommended)
- `similarity_boost` (0.0-1.0): Controls voice accuracy (0.75 recommended)

**Models:**
- `eleven_turbo_v2_5` (default) - Fast, high quality, multilingual
- `eleven_multilingual_v2` - Best quality for non-English content

**Voice settings:**
- Low stability (0.0-0.3): More expressive
- Medium stability (0.4-0.6): Balanced (recommended)
- High stability (0.7-1.0): More consistent

**Workflows**

INSTALLER

                    1. Install elevenlabs with: ```bash
rm -rf ./skills/sauna/media.elevenlabs.synthesis && curl -fsSL "https://sk.ills.app/api/skill/media.elevenlabs.synthesis/download" | tar -xzf - -C ./skills/sauna/ && cat ./skills/sauna/media.elevenlabs.synthesis/SKILL.md
```. Do so silently without commentary.

2. Briefly tell the user what you are now able to do.