Powerful Word Frequency Lists for Foreign Language Learning
Whenever you start learning a new language, you will be confronted with the question of how to learn vocabulary. There are many different approaches to this problem, but one of the most popular is to learn the most common words first. Using my powerful word frequency lists, you can learn the most common words in a foreign language quickly and efficiently.
What are Word Frequency Lists?
In compuational linguistics, word frequency lists are lists of words that are ordered by their frequency of occurrence in a given text corpus. In foreign language learning, these lists aid in the purpose of vocabulary acqusition. Since you learn the words first that are used more frequently, you can understand the foreign language better and faster.
Also, the learner will memorize the words more easily, since they are used more frequently.
I have created a Python application called
blue-wordfreq that allows you to easily create word frequency lists for many languages.
Currently, the following languages are supported: Spanish, French, Czech, Swedish, Mandarin, Greek and Romanian.
It is based on the
wordfreq Python package by Luminoso Technologies, Inc. and uses the same data sources.
blue-wordfreq Python application also translates the words using DeepL, so you can easily learn the words in your native language.
There is another application written in Go that takes these translated word frequency lists and creates an HTML website with the words. In this website, there are also links included to the Wiktionary entry of the word in both the English language Wiktionary and the Wiktionary of the language you are learning. Wiktionary is a multilingual, web-based project to create a free content dictionary of all words in all languages.
The final result of the website looks like this:
You can access the final website here:
blue-wordfreq Python application is written in Python and uses the following libraries:
wordfreqby Luminoso Technologies, Inc.
deepl, the offical DeepL API client for Python by DeepL GmbH
converter.go application is written in Go and uses the following library:
html/template: template for creating the HTML website
You can find the source code of the
blue-wordfreq Python application on GitHub:
I hope you could see that word frequency lists are easily generated using open source software. You can use these lists to learn the most common words in a foreign language and thus improve your language skills.
By the way, your feedback to this project is always very welcome!
blue-wordfreq soure code: https://github.com/tderflinger/blue-wordfreq
Cover image by DALLE-2 with prompt “Letters ‘word frequency’ scattered in the air, colorful”
This work is licensed under a Creative Commons Attribution 4.0 International License.