Dicta en español. Voice-to-text for Castilian, Mexican, Colombian, Argentinian, Chilean, and Caribbean Spanish on Windows. Inverted punctuation, accent marks, and Spanglish handled out of the box. Free for 500 words a day.
Regional accents, accent marks, and inverted punctuation all handled
Opening question and exclamation marks placed at the correct positions when the engine detects an interrogative or exclamatory sentence. No layout switching, no manual insertion.
Spain, Mexico, Colombia, Argentina (with voseo and Rioplatense pronunciation), Chile, Peru, Venezuela, Caribbean. The same model handles them all.
á, é, í, ó, ú, ü, and ñ produced as native characters in the transcript. Tilde-sensitive contrasts (sí versus si, más versus mas) come out correctly in most contexts.
English loanwords inside Spanish sentences (meeting, marketing, software, parking) are recognized inline. Bilingual users in the US and Mexico can dictate naturally.
Local Mode keeps audio on your Windows machine. Client conversations, patient notes, customer support chats never leave the building.
500 words a day, 3,500 a week. Pro at $10/month unlocks unlimited dictation for heavy Spanish writing workflows.
Spanish is one of the largest non-English components of the OpenAI Whisper training corpus. The model has been exposed to Spanish from Spain, Mexico, Colombia, Argentina, Chile, Peru, Venezuela, Cuba, Puerto Rico, the Dominican Republic, and the wider Caribbean, as well as US Spanish from speakers of Mexican, Cuban, and Puerto Rican descent. That breadth shows up in real-world accuracy: Spanish dictation works well across regional registers without any per-accent tuning.
StarWhisper packages Whisper for Windows as a desktop dictation app. You install the free download, set the language to Spanish in Settings, and press a global hotkey to dictate into whatever text field has focus. The transcript lands at your cursor in Outlook, Word, Gmail, Notion, Slack, Teams, WhatsApp Web, your CRM, your customer support tool, or any other Windows application.
This page is written in English because the target search query is in English (people search for "Spanish dictation software" or "voice to text for Spanish" in English when they are evaluating tools), but the actual dictation runs in Spanish at full quality. The same applies for bilingual professionals who think about software in English and write client-facing copy in Spanish.
The split between Peninsular Spanish (Spain) and Latin American Spanish is the most discussed dimension, but in practical terms Whisper handles both at the same level. The differences are lexical and phonological, not orthographic, so the transcript reflects what was said without "correcting" regional vocabulary.
Madrid, Valencia, Andalusian, and the rest of Peninsular Spanish work well. The distinción between c/z and s (theta vs. ese) is no problem because both produce the same written output. Vosotros forms come through as vosotros, not as ustedes. Spain-specific vocabulary (ordenador, coche, móvil, vale) is preserved as spoken.
Mexican Spanish is one of the most thoroughly trained variants because of the size of the Mexican media market. Both standard Mexico City Spanish and northern variants work cleanly. Mexican vocabulary (carro, computadora, celular, popote) is preserved. Mexican slang and modismos that appear in widely distributed YouTube and podcast content are recognized.
The distinctive Rioplatense pronunciation of ll and y as a sh-sound (yo as "sho", calle as "cashe") is recognized correctly by Whisper, which transcribes it back as standard Spanish orthography. Voseo (vos sos, vos tenés, vos podés) is transcribed as voseo, not corrected to tuteo. Argentinian and Uruguayan Spanish work without needing any per-country setting.
Chilean Spanish has a reputation for being one of the harder Spanish variants to transcribe because of its speed, vowel reduction, and idiomatic vocabulary. Whisper handles standard professional Chilean Spanish well; very rapid casual Chilean speech may need more cleanup than other regional varieties.
Caribbean Spanish, with its s-aspiration and consonant deletion, is one of the harder regions for any speech recognition engine. Whisper does better than older systems on Caribbean Spanish but standard speech in news-broadcast register works noticeably better than rapid colloquial conversation. Most professional dictation use cases fall in the easier range.
Spanish has punctuation conventions that English does not, and Whisper applies them correctly. The opening inverted question mark (¿) and opening inverted exclamation mark (¡) are placed at the start of interrogative and exclamatory clauses without any manual intervention. The closing marks (? and !) appear at the end as expected. Both come through without needing a Spanish keyboard layout.
Accented vowels are produced natively. á, é, í, ó, ú, and ü all appear correctly. The model knows where the orthographic accent goes for distinguishing pairs (sí versus si, más versus mas, sé versus se, té versus te, qué versus que in interrogative contexts) and gets it right in the majority of cases. The letter ñ is produced as ñ, not as n or ny.
Capitalization in Spanish differs from English in some places: months and days of the week are lowercase, nationalities are lowercase, and only the first word of a title is typically capitalized. Whisper follows these conventions. Sentence-initial capitalization and proper noun casing work as expected. For a fuller look at multi-language handling, see the multi-language feature page.
Spanish-English code-switching is the working reality for millions of bilingual professionals: US Hispanic sales reps, customer support agents serving binational markets, journalists covering Latin America for English-language outlets, Mexican consultants working on US accounts, and Spanish-speaking medical scribes in US hospitals. Whisper handles the kind of mixed speech that emerges in those settings well.
If your dictation is mostly Spanish with English loanwords (meeting, deadline, marketing, software, status, follow-up, parking, sandwich), set the StarWhisper language to Spanish and dictate normally. The engine recognizes the English tokens embedded in Spanish sentences without breaking flow. If you switch to a fully English paragraph for a few sentences in the middle of a Spanish document, accuracy on the English part is also strong.
For documents that flip between long Spanish and long English passages every paragraph or two, switching the language to Auto-detect tells the engine to pick per-segment. Brand names from English contexts (Microsoft, Salesforce, Google, LinkedIn, GitHub) keep their original casing. For bilingual long-form writing, the voice-to-text for content creators workflow applies equally to Spanish output.
US-based customer support teams serving Spanish-speaking customers can dictate responses three to four times faster than they can type. The output is clean enough for direct send after a quick review. Common support templates (greetings, escalations, troubleshooting steps, closings) come out naturally. Local Mode means the customer information stays inside your Windows environment.
Following up with bilingual prospects, writing CRM notes about Spanish-language calls, drafting Spanish-language email outreach. Set the language to Spanish, dictate the message, send. The same workflow applies for Salesforce, HubSpot, Pipedrive, Zoho, and any other CRM with a Windows desktop or web interface.
US hospitals and clinics serving Spanish-speaking populations need medical note documentation in Spanish or in English summarizing a Spanish conversation. StarWhisper handles both directions: dictate notes in Spanish for Spanish-only charts, or dictate the English summary of a Spanish-language patient encounter. Standard medical vocabulary in Spanish is recognized. For workflows beyond general dictation, see the voice-to-text for doctors page.
Spanish-language YouTube channels, podcasts, blogs, newsletters, and social media accounts. Dictate first drafts in Spanish at speech speed. Output is publishable with light editing. The unlimited Pro plan removes the daily word cap for high-volume creators.
Spanish-language legal documents, contracts, real estate listings, and client correspondence work well. Standard legal Spanish is recognized; very specialized terminology may need correction. The Local Mode privacy posture suits regulated work.
Spain has four co-official languages plus Spanish. Whisper treats them as separate languages rather than as Spanish variants, and StarWhisper exposes them through the language selector.
Catalan support is solid. The training corpus includes meaningful Catalan content from Catalan news, public broadcasting, and YouTube, so dictation in Catalan from Barcelona, Valencia (where it is called Valencian), or the Balearic Islands works well. Galician works for standard registers used in Galicia. Basque (Euskara) is harder because it is a language isolate unrelated to Romance languages, with less training data than Catalan or Galician, but it is supported.
To dictate in any of these, switch the StarWhisper language setting from Spanish to the specific language. If you write mixed-language documents (Spanish-Catalan in Barcelona offices, Spanish-Galician in Santiago de Compostela, Spanish-Basque in Bilbao), Auto-detect can handle the segmentation. For a deeper feature-level overview, see the multi-language feature page.
| Plan | Words | Price (USD) |
|---|---|---|
| Free | 500 words/day, 3,500/week | $0 |
| Pro Monthly | Unlimited | $10/month |
| Pro Annual | Unlimited | $80/year ($6.67/month) |
Billing runs through Stripe in USD. Your bank converts to EUR (Spain), MXN (Mexico), COP (Colombia), ARS (Argentina), CLP (Chile), or whatever your local currency is at the prevailing rate. There is no country-based pricing markup and no separate fee for Spanish or any other supported language. The full 96+ language pack ships in the same free installer.
The free tier is genuinely usable for personal Spanish writing: emails, journaling, WhatsApp messages, social media posts, short documents. The Pro plan makes sense for daily Spanish writing in volume: customer support agents, bilingual sales reps, medical scribes, journalists, content creators. The annual plan saves about a third compared to monthly billing. See the pricing section for the full breakdown.
StarWhisper runs on Windows 10 and Windows 11. Mac and Linux are not supported. The installer is about 100 MB; Whisper model files download on first use based on your hardware capabilities. The CPU path works on any modern Windows machine. An NVIDIA GPU with CUDA accelerates the larger models significantly. AMD and Intel discrete GPUs can use Vulkan as a cross-vendor fallback.
For Spanish dictation, the small or medium Whisper model is usually sufficient. The large model gives marginal accuracy gains on difficult speech or strong regional variants, at the cost of more VRAM and slower transcription. A decent microphone matters more than model size: a USB headset or directional desk mic produces noticeably cleaner output than laptop built-in mics. For the GPU side, see the GPU acceleration page. For installation troubleshooting, the FAQ covers common setup questions.
Other StarWhisper pages that pair well with Spanish dictation
Umlauts, compound nouns, and DACH-region dialects covered. GDPR-friendly local processing.
Kanji/hiragana/katakana auto-conversion. No IME juggling for Japanese writing.
Long-form drafting workflows for YouTube, podcast, and newsletter content in any language.
Convert Spanish, English, and bilingual WhatsApp voice notes to readable text on Windows.