FAQ
Frequently Asked Questions
Find answers to common questions about TextSpeakPro
General
TextSpeakPro is an AI-powered text-to-speech platform that converts written text into natural-sounding speech. Powered by Inworld TTS, we offer 135+ voices across Free, Starter, Pro, and Studio tiers in 15 languages, plus speech-to-text, voice design, SRT subtitle export, and more.
The free plan gives you 2,000 characters per month with a maximum of 2,000 characters per request. You get 10 free voices. Usage resets monthly. The free plan includes streaming/playback only (no downloads) and is ad-supported. No account is required. Usage is tracked anonymously. You can upgrade to Starter ($4/mo) to unlock all 135+ voices, remove ads, enable downloads, speech-to-text, and get 150,000 characters/month.
We support 15 languages with regional accents: English (US, UK, Australian, Indian), Spanish, French, German, Italian, Portuguese (Brazil), Japanese, Korean, Chinese (Mandarin), Hindi, Arabic, Hebrew, Russian, Dutch, and Polish.
Yes! All audio generated on TextSpeakPro is yours to use for any purpose, including commercial projects like YouTube videos, podcasts, advertisements, e-learning content, and more.
Plans & Billing
Free gives you 2,000 characters/month with ads and 10 free voices (streaming only, no downloads). Starter ($4/mo) removes ads and unlocks all 135+ voices (Mini model) with 150,000 characters/month, MP3 + WAV downloads, speech-to-text, and commercial use. Pro ($9/mo) upgrades to the Max model for higher quality with 350,000 characters/month, audio markup, SRT subtitle export, faster queue, and generation history export. Studio ($15/mo) adds voice cloning, voice design, API access, priority queue, and 750,000 characters/month. Annual plans are also available at a discount.
We offer monthly subscriptions billed at the start of each billing period. Your character limits reset at the beginning of each billing cycle. When you upgrade, the change takes effect immediately and you are charged a prorated amount (only the price difference for the remaining days in your cycle). When you downgrade, you keep your current plan until the end of the billing cycle, then the lower plan and rate take effect. No partial refund is issued for unused days on the higher plan. All plan changes are managed through the Stripe billing portal (Account > Manage Billing). You can cancel anytime. Your plan stays active until the end of the cycle. No long-term contracts. Prices shown are before applicable tax.
Upgrades take effect immediately. You are charged only the prorated difference for the remaining days in your billing cycle. Downgrades take effect at the end of your current billing cycle. You keep your current plan's features until then, and no partial refund is issued for unused days on the higher plan. To change your plan, go to Account > Manage Billing, which opens the Stripe billing portal where you can switch plans, update payment methods, or cancel.
We accept all major credit cards (Visa, Mastercard, American Express) through our secure payment processor, Stripe.
Each paid plan has a specific top-up option available when your monthly characters are used up: Starter gets +100,000 for $3, Pro gets +300,000 for $7, and Studio gets +1,000,000 for $20. Top-up characters are consumed only after your base allowance is exhausted. Free users can upgrade to a paid plan for more characters.
Top-ups are one-time character purchases available to all paid plan subscribers when their monthly allocation is depleted. Each plan has one specific top-up: Starter gets +100,000 for $3, Pro gets +300,000 for $7, and Studio gets +1,000,000 for $20. Credits are consumed only after your base plan allowance is exhausted. The purchase button is greyed out until your monthly characters are fully used. Top-ups do not replace subscriptions.
The free plan supports streaming and playback only. To download generated audio files, upgrade to any paid plan (Starter, Pro, or Studio).
Voices & Features
Voices are grouped into four tiers: Free voices (available on all plans), Starter voices (available on Starter, Pro, and Studio plans), Pro voices (select expressive character voices available on Pro and Studio plans), and Studio voices (full premium voice library available on Studio plan only). Cloned voices are also available on the Studio plan.
Voice cloning is available on the Studio plan ($15/mo) only. Upload an audio sample (15-30 seconds of clear speech) and your cloned voice is ready within seconds for use in TTS. Studio accounts get 3 voice slots total (shared between cloned and designed voices), and you can delete or swap any voice at any time. You can also rename a clone any time from the Account page; the rename syncs to the Inworld dashboard. You must have legal rights and consent to clone any voice you submit.
Yes! Free users can preview 3 Studio voices (Hunter, Sarah, and Benedict) up to 3 times at no cost. Just select one from the Studio Previews section in the voice dropdown and click Preview. After your 3 previews, upgrade to Studio ($15/mo) for unlimited access to all Studio voices.
No separate trial. The Free plan is 2,000 characters per month, forever, with no credit card required. You also get 3 free previews of Studio voices (Hunter, Sarah, and Benedict) before deciding to upgrade.
You can adjust speed (0.5x to 2x), pitch (Very Low to Very High), volume (25% to 100%), sample rate (16kHz to 48kHz), and output format (MP3 or WAV). You can also choose a delivery style, add audio markup emotion tags and non-verbal sound effects, and filter voices by language, region, and gender.
Audio markups let you add emotion and delivery tags like [happy], [whisper], [excited], and [professional] to your text. Sound effects let you add non-verbal sounds like [laugh], [sigh], [gasp], [cough], and [breath]. Audio markups are available on Pro and Studio plans, while sound effects are available on all plans. Tags are not counted toward your character usage.
Voice Design lets you create a custom AI voice from a text description. Describe the voice you want (age, gender, tone, accent, personality) and the AI generates a unique voice for you. Voice Design is available on the Studio plan and shares the same 3-slot pool with voice cloning. You can delete or swap any designed voice at any time.
When generating audio on Pro or Studio plans, check the "Include subtitles" option. After generation, an SRT download button appears next to the audio download button. The SRT file contains word-level timestamps that sync with your audio, perfect for adding captions to videos.
Speech to Text lets you upload an audio file (WAV, MP3, WEBM, or M4A) and get a text transcript. Available on Starter plans and above. Daily limits vary by plan. You can edit the transcript and even use it directly as TTS input.
We support MP3 (compressed, smaller file size) and WAV (uncompressed, higher quality). Both are widely compatible with video editing software, media players, and other audio tools.
Technical
It varies by plan: Free (2,000), Starter (5,000), Pro (10,000), and Studio (20,000) characters per request. For longer texts, you can split them into multiple requests.
Most audio is generated within a few seconds. Longer texts may take slightly more time, but typically no more than 10-15 seconds for maximum-length requests.
Yes. All data is transmitted over HTTPS encryption. Our backend runs on Cloudflare Workers with secure environment variable handling for all API keys and credentials. We do not store your input text after processing.
No. TextSpeakPro is entirely web-based. It works in any modern browser on desktop or mobile devices. No plugins, downloads, or extensions needed.
Yes! Commercial use is included on all paid plans (Starter $4/mo and up). The Free plan is for personal, non-commercial use only.
API access is available exclusively on the Studio plan ($15/mo). To get your API key: go to Account -> Security tab, click "Create API Key", and copy the key (it is shown only once). Use it in your requests with the Authorization: Bearer <key> header. Full API documentation is available at /api-docs.
Share your unique referral link (found on your Account page) with friends. When they sign up and subscribe to a paid plan, you earn progress toward bonus character rewards: 3 referrals = +100,000 characters, 5 referrals = +300,000 characters, 10 referrals = +1,000,000 characters. Only referrals who become paying subscribers count. Bonus characters expire after 90 days.
Compliance & Legal
Yes. When you generate audio through TextSpeakPro, you own the output and can use it commercially including in videos, podcasts, marketing, audiobooks, and business applications. You are required to disclose that the content is AI-generated where applicable, and you must comply with all applicable laws and our Terms of Service.
Yes. Our voice provider Inworld AI requires mandatory AI disclosure. This means you must clearly indicate to your listeners or viewers that the voice they are hearing is AI-generated. How you disclose this varies by use case (a spoken intro, a visible label, or metadata) but disclosure is required.
No. You can only clone voices you own or have explicit written consent to use. Cloning celebrities, politicians, or any third party without their consent violates our Terms and may violate right of publicity laws. Accounts that do this will be terminated.
No. Outputs from TextSpeakPro cannot be used as training data for any AI or machine learning model, including competing text-to-speech models. This is required by our voice provider Inworld AI.
Voice clones and designed voices are tied to your active Studio subscription. If you cancel, they are preserved for a grace period in case you resubscribe, then permanently deleted. You can also delete or swap them yourself at any time from the Create page (up to 3 voices total per account, cloned and designed combined).
Still have questions?
Can't find what you're looking for? Reach out and we'll help.