Batch TTS Processing

Convert hundreds of texts to speech at once. Enter rows manually or upload a CSV/XLSX spreadsheet to generate audio for every line.

# Text Voice Filename
1
2
3

Drag and drop your CSV file here, or browse

CSV with columns: text, voice (optional), filename (optional)
file.csv
# Text Voice Filename
Example CSV format:
textvoicefilename
Welcome to our serviceaf_heartwelcome
Your order has shippedorder_shipped
Thank you for callingbf_emmathanks

Drag and drop your Excel file here, or browse

XLSX with columns: text, voice (optional), filename (optional)
file.xlsx
# Text Voice Filename
Example XLSX format:
textvoicefilename
Welcome to our serviceaf_heartwelcome
Your order has shippedorder_shipped
Thank you for callingbf_emmathanks
0 items ready to generate 0 total characters
Free with Kokoro model
Processing may take several minutes for large batches
Processing 0 / 0... 0%
Batch Results
# Filename Status Actions

Batch Settings

Used when a row does not specify a voice

Credits

Estimated total cost 0 credits
Current balance Not logged in
Need more credits?

Batch Processing Tips

  • Use CSV for 100+ items for the fastest workflow
  • Each row generates one audio file
  • Max 500 items per batch
  • Processing time depends on total text length
  • Leave the voice column empty to use the default voice
  • Downloaded ZIP preserves your naming pattern

How Batch TTS Processing Works

Generate audio for hundreds of texts in three simple steps. No scripting or API knowledge required.

Step 1

Add Your Texts

Enter texts manually one row at a time, or upload a CSV or XLSX spreadsheet. Each row becomes one audio file. Optionally specify a voice and filename per row for full control over every output.

Step 2

Configure Settings

Choose a default AI model and voice that applies to all rows without a specific voice. Select your output format (MP3 or WAV) and set a naming pattern: use the filename column, sequential numbers, or a custom prefix.

Step 3

Generate & Download ZIP

Click Generate All and watch real-time progress as each row is processed. Play or download individual results, or grab everything at once as a single ZIP archive. Failed items can be retried individually.

Batch TTS Use Cases

Batch processing saves hours of repetitive work across education, telephony, e-commerce, and content localization.

E-Learning Modules

Generate narration for entire courses at once. Upload a spreadsheet of lesson scripts and produce consistent, professional audio across all modules without recording a single word manually.

IVR Prompts

Create all your phone system prompts in one batch. Menu options, hold messages, after-hours greetings, and department transfers -- generate hundreds of prompts with a consistent voice and tone.

Product Descriptions

Turn your product catalog into audio descriptions for accessibility, smart speakers, or in-store kiosks. Export your product CSV and generate one audio file per item with matching filenames.

Localization

Translate your UI strings or content into multiple languages, then batch-generate audio for each locale. Perfect for apps, games, and websites that need localized voice prompts or narration.

Audio Catalogs

Build searchable audio libraries from structured data. Museums, libraries, and archives can convert metadata, descriptions, or transcripts into audio guides with batch processing.

Notification Sounds

Generate a full set of notification, alert, and confirmation voice prompts for your app or device. Upload a list of short messages and download a ZIP of ready-to-integrate audio files.

Batch Processing Features

Everything you need to process large volumes of text-to-speech efficiently and reliably.

CSV & XLSX Import

Upload a spreadsheet with your texts and the system parses it automatically. The first column is the text to convert, and optional columns let you specify a voice and filename per row. Supports standard CSV with comma or semicolon delimiters, as well as Excel XLSX files. Headers are auto-detected so column order does not matter.

Per-Row Voice Control

Assign a different voice to each row for maximum flexibility. Use the voice column in your spreadsheet or the dropdown in manual entry mode. Rows without a specified voice automatically use the default voice from your batch settings. Mix and match voices within a single batch to create varied, natural-sounding audio sets.

Progress Tracking

Watch a real-time progress bar and per-row status updates as your batch is processed. Each row transitions through queued, processing, completed, and failed states. The results table updates live so you can start previewing completed audio files while the rest of the batch is still generating. Failed items display error details and can be retried.

ZIP Export

Download all generated audio files as a single ZIP archive with one click. Files are named according to your chosen naming pattern: original filenames from the spreadsheet, sequential numbers (001.mp3, 002.mp3), or a custom prefix plus number. The ZIP preserves the order of your input rows for easy mapping back to your source data.

Frequently Asked Questions

Batch processing lets you convert many texts to speech at once. Upload a CSV or XLSX spreadsheet with your texts, choose voices and settings, and generate audio for every row. Download all results as a ZIP file.

Your CSV should have at minimum a "text" column. Optional columns: "voice" (voice ID), "filename" (output filename), "model" (TTS model). If voice/model are not specified per row, the default settings are used.

Up to 500 items per batch. For larger batches, split your CSV into multiple files. Each item is processed independently, so a failure on one row does not affect others.

Yes. Add a "voice" column to your CSV with the voice ID for each row. Rows without a voice specified will use the default voice from the batch settings panel.

Processing time depends on total text length and the model used. Kokoro processes approximately 100 rows (500 characters each) in about 2 minutes. Slower models like Tortoise will take longer.

Yes. The interface shows a progress bar and per-row status (queued, processing, completed, failed). You can see results appear in real-time and start downloading completed files before the batch finishes.

Choose MP3 or WAV as the output format. All files in the batch use the same format. Download individual files or all results as a single ZIP archive.

By default, files are numbered sequentially (001.mp3, 002.mp3, ...). You can specify custom filenames via the "filename" CSV column, or set a prefix pattern in the settings.

Yes. Submit individual TTS requests to our API in parallel. Each returns a UUID for polling. For convenience, the web batch tool handles the orchestration, parallelism, and ZIP packaging for you.

E-learning course narration (one row per slide), IVR phone system prompts, product description audio, app notification sounds, multilingual content localization, and accessibility audio for large content libraries.

Same credit rates as individual TTS. Free-tier models (Kokoro, Piper) cost 0 credits. Standard models cost 2 credits per 1,000 characters. The batch tool shows estimated total cost before you start.

Yes. After a batch completes, failed rows are marked with an error message. You can retry failed items individually or filter to re-process all failures with one click.
5.0/5 (1)

Process Hundreds of Texts at Once

Upload a spreadsheet and generate audio for every row. Perfect for e-learning, IVR, and localization.