Ha Language Corpus (ha-corpus)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

This corpus of spoken Ha language consists of transcripts of elicited types of natural speech (stories and elicited sentences) collected in the towns of Kibondo, Kasulu and Kigoma and nearby regions in Tanzania during the years 1997, 2000 and 2003. The original transcripts have been pseudonymized.
Ha language (ISO 639-3: haq; Great Lakes Bantu language JD66; alternative names Igiha, Giha, Kiha) is spoken in Western Tanzania in the Kigoma region. It is closely related to, for example, Rundi of Burundi and Kinyarwanda of Rwanda. Ha is one of biggest languages in Tanzania with approximately 1,2 million speakers.

The collection and analysis of the corpus data is described in the following publication:
Harjula, Lotta 2004. The Ha Language of Tanzania: Grammar, Texts, and Vocabulary. East African Languages and Dialects 13. Cologne: Köppe. ISBN 978-3-89645-027-2.

License and access

  • The versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 

 


This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2026042401

Last modified on 2026-04-27