In English it is pretty good. But talk to it in Polish, and suddenly it thinks y...

loire280 · 2026-02-04T21:07:34 1770239254

They don't claim to support Polish, but they do support Russian.

> The model is natively multilingual, achieving strong transcription performance in 13 languages, including English, Chinese, Hindi, Spanish, Arabic, French, Portuguese, Russian, German, Japanese, Korean, Italian, and Dutch. With a 4B parameter footprint, it runs efficiently on edge devices, ensuring privacy and security for sensitive deployments.

I wonder how much having languages with the same roots (e.g. the romance languages in the list above or multiple Slavic languages) affects the parameter count and the training set. Do you need more training data to differentiate between multiple similar languages? How would swapping, for example, Hindi (fairly distinct from the other 12 supported languages) for Ukrainian and Polish (both share some roots with Russian) affect the parameter count?

MarcelOlsz · 2026-02-04T21:38:25 1770241105

Nobody ever supports Polish. It's the worst. They'll support like, ̵Swahili, but not Polish.

edit: I stand corrected lol. I'll go with "Gaelic" instead.

chickenimprint · 2026-02-04T21:58:31 1770242311

Swahili is subcontinental lingua franca spoken by 200M people and growing quickly. Polish is spoken by a shrinking population in one country where English is understood anyways.

viraptor · 2026-02-05T09:21:25 1770283285

> where English is understood anyways.

It's popular. But not that popular - you couldn't assume a random person over 30yo on the street would be able to have a chat.

londons_explore · 2026-02-04T21:56:40 1770242200

200 million people speak Swahili.

39 million people speak Polish, and most of those also speak English or another more common language.

timhh · 2026-02-04T22:10:17 1770243017

You could say the same about Dutch to be fair. 90-95% speak English - I bet that's way higher than in Poland.

gerad · 2026-02-04T23:59:21 1770249561

As an American, my perspective is that Dutch people speak better English than a large percentage of English people and Americans.

RestartKernel · 2026-02-05T13:27:48 1770298068

As a Dutch person, I'm very doubtful that's the case, but I'm willing to bet a good ESL speaker is more aware of common grammatical errors than some native speakers. For example, the your/you're mixup makes no sense if you've had to explicitly learn about English contractions in the first place.

vkazanov · 2026-02-05T06:19:55 1770272395

Heh, based on my incorrect and probably wrong experience Dutch and Swedes are the best non-native english speakers in term of both the accent and fluency.

viraptor · 2026-02-05T09:28:15 1770283695

Those and Icelandic people. But there's a fun correlation - see how much the US media content is played compared to local one per country. And which countries use subs rather than dubs or voiceovers in cinemas and TV. https://publications.europa.eu/resource/cellar/e4d5cbf4-a839...

If you have exposure to English media from young age and don't get a translation, you learn pretty quickly.

_ache_ · 2026-02-05T00:52:42 1770252762

Just a side note to remember that this is a mini model. It's very small and yet 12 languages.

I guess a European version can be created but now it's aimed at a world wide distribution.

sbinnee · 2026-02-05T08:11:41 1770279101

I guess I will check Korean. OpenAI audio mini is not bad but I always have to make gpt to check and fix transcription.

lm28469 · 2026-02-04T19:30:27 1770233427

> The model is natively multilingual, achieving strong transcription performance in 13 languages, including English, Chinese, Hindi, Spanish, Arabic, French, Portuguese, Russian, German, Japanese, Korean, Italian, and Dutch.

Try sticking to the supported languages

tdb7893 · 2026-02-04T18:37:21 1770230241

Yeah, it's too bad. Apparently it only performs well in certain languages: "The model is natively multilingual, achieving strong transcription performance in 13 languages, including English, Chinese, Hindi, Spanish, Arabic, French, Portuguese, Russian, German, Japanese, Korean, Italian, and Dutch"

ricardonunez · 2026-02-04T19:37:10 1770233830

It did great English and Spanish, it didn't switch to Portuguese, french nor German, maybe struggle with my accent.

scotty79 · 2026-02-04T20:11:22 1770235882

Try to warn it you are going to switch language to Portugese. Worked for me.

yko · 2026-02-04T19:13:07 1770232387

That's a mix of Polish and Ukrainian in the transcript. Now, if I try speaking Ukrainian, I'm getting transcript in Russian every time. That's upsetting.

overfeed · 2026-02-04T20:09:43 1770235783

Oh no! The model won't translate to an unsupported language, and incorrectly reverts to one that it was explicitly trained on.

The base likely was pretrained on days that included Polish and Ukrainian. You shouldn't be surprised to learn it doesn't perform great on languages it wasn't trained on, or perhaps had the highest share of training data.

scotty79 · 2026-02-04T20:12:09 1770235929

Tell it you are going to speak Polish now. It helps.

Cthulhu_ · 2026-02-05T08:59:31 1770281971

Cracking non-English or accented / mispronounced English is the white whale of text-to-speech I think; I don't know about you, but in our day to day chats there's a lot of jargon, randomly inserted English words, etc. And when they speak in English it's often what I call expat-English which is what you get when non-native speakers only speak the language with other non-native speakers.

Add poor microphone quality (using a laptop to broadcast a presentation to a room audience isn't very good) and you get a perfect storm of untranscribeable presentations or meetings.

All I want from e.g. Teams is a good transcript and, more importantly, a clever summary. Because when you think about it, imagine all the words spoken in a meeting and write them down - that's pages and pages of content that nobody would want to read in full.

moffkalast · 2026-02-04T22:29:13 1770244153

I'm not sure why but their multilingual performance in general has usually been below average. For a French company, their models are not even close to being best in French, even outdone by the likes of Qwen. I don't think they're focusing on anything but English, the rest is just marketing.

mystifyingpoi · 2026-02-04T18:36:10 1770230170

TBH ChatGPT does the same, when I mix Polish and English. Generally getting some cyrillic characters and it gets super confused.

DaedalusII · 2026-02-04T23:38:14 1770248294

polish logically should be rendered in cyrillic as the cyrillic orthography more closely matches the sounds and consonant structure of slavic languages like polish and russian, although this has never been done for church reasons . maybe this is confusing ai

iagooar · 2026-02-04T23:46:41 1770248801

Polish has been written with Latin alphabet since the 13th century. And before it simply wasn't written.

Polish works with the Latin alphabet just fine.

"Do kraju tego, gdzie kruszynę chleba podnoszą z ziemi przez uszanowanie dla darów Nieba.... Tęskno mi, Panie..."

"Mimozami jesień się zaczyna, złotawa, krucha i miła. To ty, to ty jesteś ta dziewczyna, która do mnie na ulicę wychodziła."

viraptor · 2026-02-05T09:36:06 1770284166

> although this has never been done for church reasons

That's not the case. Polish uses Latin-like alphabet due to Czech influence and German printers.