Identifying Basque dialects

Sun, 28 Jun 2026 00:00:00 +0000

TL;DR

I’ve created an experiment called NongoEuskara. To do this, I gathered data, trained a fastText classification/embedding model, and built this little demo website. Using WebAssembly, these models run directly in the browser.

The long explanation

We know that large language models (LLMs) are capable of doing pretty much everything. Even though they weren’t originally designed for it, today they can program, translate, transcribe audio, summarize long texts, and a long etcetera. Sometimes, however, the best solution isn’t always using an LLM. For example, for quick and simple translations, there are lighter neural translators that require far fewer resources than LLMs.

Fasttext on Xabi Ezpeleta

Identifying Basque dialects

TL;DR

The long explanation