Identifying Basque dialects

TL;DR I’ve created an experiment called NongoEuskara. To do this, I gathered data, trained a fastText classification/embedding model, and built this little demo website. Using WebAssembly, these models run directly in the browser. The long explanation We know that large language models (LLMs) are capable of doing pretty much everything. Even though they weren’t originally designed for it, today they can program, translate, transcribe audio, summarize long texts, and a long etcetera. Sometimes, however, the best solution isn’t always using an LLM. For example, for quick and simple translations, there are lighter neural translators that require far fewer resources than LLMs. ...

June 28, 2026