choose a language or topic »|Saturday, March 23, 2019
You are here: Home » Editors Picks » Language Families
  • Follow Us!

Language Families 


   What is a language family?

Most languages belong to language families. A language family is a group of related languages that developed from a common historic ancestor, referred to as protolanguage (proto– means ‘early’ in Greek). The ancestral language is usually not known directly, but it is possible to discover many of its features by applying the comparative method that can demonstrate the family status of many languages. Sometimes a protolanguage can be identified with a historically known language. Thus, provincial dialects of Vulgar Latin are known to have given rise to the modern Romance languages, so the *Proto-Romance language is more or less identical to Latin. Similarly, Old Norse was the ancestor of NorwegianSwedishDanish and Icelandic. Sanskrit was the protolanguage of many of the languages of the Indian subcontinent, such as BengaliHindiMarathi, and Urdu. Further back in time, all these ancestral languages descended, in turn, from one common ancestor. We call this ancestor *Proto-Indo-European (PIE). Language families can be subdivided into smaller units called branches. For instance, the Indo-European family has several branches, among them, Germanic, Romance, and Slavic.

   How do linguists establish relationships among languages?

Sometimes it is relatively easy to establish relationships among languages. Let us look at the Romance languages. We know that Italian is a descendant of Latin, a language that was spoken in Italy two thousand years ago, and one which left a great number of written documents. The Roman conquest helped spread Latin throughout Europe where it eventually developed into regional dialects. When the Roman Empire broke up, these regional dialects evolved into the modern Romance languages that we know today: FrenchItalianPortugueseSpanish, and others. These languages form the Romance branch of the Indo-European language family. By looking at the word for ‘water’ in three Romance languages, one can easily see the similarities among them.

Italian acqua
Spanish agua
Portuguese agua
   What if the ancestral language left no records?

The case with Romance languages is unusually easy because their common ancestor — Latin — left many written documents. In most cases, however, the ancestral language was not written. As a result, linguists look at similarities among its modern descendants to establish common origins. Take a look at these examples:


It is clear that the word for ‘water’ looks very similar within each group, but not quite as similar across groups. Languages in the first group belong to the Germanic branch of the Indo-European language family. Languages in the second group belong to the Slavic branch. Although there are no written records of the ancestral *Proto-Germanic or *Proto-Slavic languages, we have to assume that these two ancestral languages must have existed at some time, just like Latin did.

   Where do these mystery languages belong?

Here is the word for ‘water’ in two more languages. Do you think these languages belong to any of the branches above?

Latvian udens
Albanian uje
Basque ur

As it turns out, Latvian belongs to the Baltic branch of the Indo-European language familyAlbanian has no close relatives and does not belong to any of the branches of the Indo-European language family, and Basque does not belong to any language family at all. In fact, it is a language isolate, i.e., a language that cannot be reliably assigned to any established language family.

Read “Exploratorium Magazine Lecture on the Evolution of Language

   What if there are no records, and we know little about the languages?

In many parts of the world, there are no written records, and we don’t know enough about the languages themselves. Consequently, we have to resort to grouping languages on the basis of geography. This is the case with many of the aboriginal languages of Australia, the native Indian languages of the Americas, the tribal languages of Africa, and countless other languages all over the world.

   How many language families are there?

According to Ethnologue (16th edition), there are 147 language families in the world. This figure may not be precise because of our limited knowledge about many of the languages spoken in the most linguistically diverse areas of the world such as Africa. The actual number of families, once these languages are studied and relationships among them are established, will undoubtedly keep changing.

   World’s largest language families

The largest language families (those with over 25 languages) are listed below (Ethnologue). There are 6,523 languages in this group, and together they account for close to 95 percent of all world languages (assuming that there are some 6,900 languages in the world). The remaining families account for only 5 percent of the world languages. In addition, there are 53 languages considered unclassified.

Language families Number of languages Where the languages are spoken
Angola, Benin, Botswana, Burkina Faso, Burundi, Cameroon, Central African Republic, Chad, Comoros, Congo, Cote d’Ivoire, Democratic Republic of the Congo, Equatorial Guinea, Gabon, Gambia, Ghana, Guinea, Guinea-Bissau, Kenya, Lesotho, Liberia, Malawi, Mali, Mozambique, Namibia, Niger, Nigeria, Rwanda, Senegal, Sierra Leone, Somalia, South Africa, Sudan, Swaziland, Tanzania, Togo, Uganda, Zambia, Zimbabwe
Brunei, Cambodia, China, Cook Islands, East Timor, Fiji, French Polynesia, Guam, Indonesia, Kiribati, Madagascar, Malaysia, Marshall Islands, Mayotte, Micronesia, Myanmar, Nauru, New Caledonia, New Zealand, Niue, Northern Mariana Islands, Palau, Papua New Guinea, Philippines, Samoa, Solomon Islands, Suriname, Taiwan, Thailand, Tokelau, Tonga, Tuvalu, USA, Vanuatu, Viet Nam, Wallis and Futuna
Australia, East Timor, Indonesia, Papua New Guinea
Bangladesh, Bhutan, China, India, Kyrgyzstan, Laos, Myanmar, Nepal, Pakistan, Thailand, Viet Nam
Afghanistan, Albania, Armenia, Austria, Azerbaijan, Bangladesh, Belarus, Belgium, Bosnia and Herzegovina, Bulgaria, Canada, China, Croatia, Czech Republic, Denmark, Fiji, Finland, France, Georgia, Germany, Greece, Iceland, India, Iran, Iraq, Ireland, Israel, Italy, Latvia, Lithuania, Luxembourg, Macedonia, Maldives, Nepal, Netherlands, Norway, Oman, Pakistan, Peru, Poland, Portugal, Romania, Russia, Serbia and Montenegro, Slovakia, Slovenia, South Africa, Spain, Sri Lanka, Suriname, Sweden, Switzerland, Tajikistan, Turkey, USA, Ukraine, United Kingdom, Venezuela
Afghanistan, Algeria, Bahrain, Cameroon, Chad, Cyprus, Egypt, Eritrea, Ethiopia, Georgia, Iran, Iraq, Israel, Jordan, Kenya, Libya, Mali, Malta, Mauritania, Morocco, Niger, Nigeria, Oman, Saudi Arabia, Somalia, Sudan, Syria, Tanzania, Tunisia, Turkey, United Arab Emirates, Uzbekistan, Yemen
Algeria, Benin, Central African Republic, Chad, Democratic Republic of the Congo, Egypt, Eritrea, Ethiopia, Kenya, Mali, Niger, Nigeria, South Sudan, Sudan, Tanzania, Uganda
Algeria, Benin, Burkina Faso, Central African Republic, Chad, Democratic Republic of the Congo, Eritrea, Ethiopia, Kenya, Mali, Niger, Nigeria, Sudan, Tanzania, Uganda
China, India, Laos, Myanmar, Thailand, Viet Nam
India, Nepal, Pakistan
Antigua and Barbuda, Australia, Bahamas, Barbados, Belize, Brazil, Cameroon, Cape Verde Islands, Central African Republic, Chad, Colombia, Congo, Curacao, Democratic Republic of the Congo, East Timor, Equatorial Guinea, French Guiana, Ghana, Grenada, Guadeloupe, Guinea-Bissau, Guyana, Haiti, India, Indonesia, Jamaica, Macao, Malaysia, Mauritius, Mexico, Micronesia, New Caledonia, Nicaragua, Nigeria, Norfolk Island, Panama, Papua New Guinea, Philippines, Réunion, Saint Lucia, Saint Vincent and the Grenadines, São Tomé e Príncipe, Seychelles, Sierra Leone, Singapore, Solomon Islands, South Africa, South Sudan, Sri Lanka, Suriname, Tanzania, Trinidad and Tobago, Turks and Caicos Islands, U.S. Virgin Islands, Uganda, United States, Vanuatu
Bolivia, Brazil, French Guiana, Paraguay, Peru
Bolivia, Brazil, Canada, Chile, Colombia, Ecuador, India, Indonesia, Japan, Mali, Mexico, Nepal, Nigeria, Pakistan, Papua New Guinea, Peru, Russian Federation, South Korea, Spain, United States, Venezuela
Belize, Guatemala, Mexico
Afghanistan, Azerbaijan, China, Georgia, Iran, Kazakhstan, Kyrgyzstan, Lithuania, Moldova, Mongolia, Russia, Turkey, Turkmenistan, Uzbekistan
El Salvador, Mexico, United States
Bahamas, Bolivia, Brazil, Colombia, Guyana, Honduras, Peru, Suriname, Venezuela
Papua New Guinea
Papua New Guinea
Argentina, Bolivia, Chile, Colombia, Ecuador, Peru
Canada, United States
Canada, United States
China, Laos, Vietnam
Estonia, Finland, Hungary, Latvia, Norway, Russia, Sweden
Azerbaijan, Georgia, Russian Federation
Canada, United States
Bolivia, Brazil
Papua New Guinea
Brazil, Colombia, Guyana, Suriname, Venezuela
Bolivia, Brazil, Peru
Angola, Botswana, Namibia, South Africa, Tanzania
Canada, United States
Brazil, Colombia, Ecuador

*Quechua is also classified as a macrolanguage, i.e., a family of varieties of a single language that are not distinct enough to be considered separate languages.


20 Responses to Language Families

  1. Curious

    I read here, “According to Ethnologue, there are 218 language families in the world,” yet I’m looking at the Ethnologue website at this very moment and read, “There are 136 language families.” Why did you make up that large number? Does anyone here do facct checking? Is this a fluke, or can I trust anything at your website?

    Just curious…

    • Irene Thompson

      We did not make it up. Our information was based on an earlier version of Ethnologue (version 14) which had that number. We are in the process of updating many of our pages, as you can see from above.

  2. Seemless

    your statement that “Albanian has no close relatives and does not belong to any of the branches of the Indo-European language family.”(sic!) is highly misleading!!!

    Albanian IS an Indo-European language, but it is in a branch of Indo-European all by itself!

    (It used to be considered as belonging to the Illyrian branch.) But your phrasing makes it appear that Albanian is not a branch of / does not berlong to the Indo-European family of languages at all!

    Linguistic affinities [from wikipedia’s article on the Albanian language]:

    The Albanian language is an Indo-European language in a branch by itself, sharing its branch with no other extant language. (The other extant Indo-European languages in a branch by themselves are Armenian and, in some classifications, Greek.)

    Though sharing lexical isoglosses with Greek, Balto-Slavic, and Germanic, the vocabulary of Albanian is quite distinct. Once hastily grouped with Germanic and Balto-Slavic based on the merger of PIE *ǒ and *ǎ into *ǎ in a supposed “northern group”,[8] Albanian has been proven to be distinct from these two because this vowel shift is only part of a larger push chain that affected all long vowels.[9]

    Albanian does share two features with Balto-Slavic languages: a lengthening of syllabic consonants before voiced obstruents and a distinct treatment of long syllables ending in a sonorant.[10] Conservative features of Albanian include the retention of the distinction between active and middle voice, present tense, and aorist.

    Albanian is considered to have evolved from an ancient Paleo-Balkan language, usually taken to be either Illyrian or Thracian, but this is disputed. (See also Thraco-Illyrian and Messapian language.)

    • Irene Thompson

      You are misreading the statement your statement that “Albanian has no close relatives and does not belong to any of the branches of the Indo-European language family.”(sic!) is highly misleading!!! It never implies that Albanian is not considered to be an IE language. But thank you for your comment.

  3. seemless

    Not to put too fine of a poin on it, but I repeat, you should simply modify your article to read:

    “The Albanian language is an Indo-European language in a branch all by itself”

    rather than the present wording:

    “The Albanian language … does not belong to any of the branches of the Indo-European language family,”

    because what you’ve written, i.e., if it does NOT belong to ANY of the branches of the Indo-European language family, then the meaning of that statement, in plain English, is that it does not belong to the Indo-European language family AT ALL!

    But otherwise, thank you very much for your efforts in providing this otherwise very helpful web site!


    I’m thanking Mr. Thompson that he has give us a brief idea about language families.
    I’m a scool student. As my knowledhge is so brief about linguistics I criticise your article.But after reading the comments I think you should modify the ‘Albanian’ part.The terms that you have used, is not suitable for it.Again I’m thanking you for your this work.

  5. Prodeep Kumar Daimary

    I was searching about the linguistic families of the world. But I did not come across the exact name list of the principal linguistic families clearly.

  6. Pingback: What makes languages different? Part Two: Signals and Understanding | Dreamers and Doers

    • Irene Thompson

      Relationships among languages are considered from a historical perspective precisely for the purpose of establishing their genealogy, much the same as a family tree. This in no way provides a metric for mutual comprehensibility in either the spoken or the written form. Your example of Hungarian and Finnish (and, for that matter, Estonian) means that the ancestral language has evolved along different paths.

  7. Tommaso Lumare

    In the section “How do linguists establish relationships among languages?” the right italian word for water is “acqua” and not “aqua”.

  8. social media

    It’s hard to come by educated people in this particular
    topic, but you seem like you know what you’re talking about!


  9. gia xe toyota land cruiser prado 2017

    My brother recommended I would possibly like this blog. He used to be entirely right.
    This post truly made my day. You can not imagine just how
    much time I had spent for this information! Thank you!

  10. toyota tan cang

    Fine way of explaining, and good post to take facts about my
    presentation subject matter, which i am going to present in school.

  11. heriberto navarro

    beautifull try harder, this is so helpful for all the people who enjoy searching for the meaning of our history as a human race and our way to communicate with each other, actually that is the secret of our success as species …so interesting, so clear an invitation to my curiosity..!!

  12. Austin Chang

    Please include Hawaiian in the Austronesian family!!
    The Hawaiian language has had enough erasure in history, even when most of the world knows what Hawai’i is.

    • Irene Thompson

      We did not mean to exclude Hawaiian. We included only languages with a million or more speakers. There are many other sites that list it. See Ethnologue, Wikipedia, and many other language sites, including our entry on Hawaiian.


Add a Comment