Helsinki Corpus of Swahili 2.0


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Resource information

Helsinki Corpus of Swahili 2.0 is available for research purposes in Kielipankki – the Language Bank of Finland. The corpus contains about 25 million words of written text, and it is available in two formats. The annotated version contains morphological and syntactic annotation as well as glosses in English. The not annotated version contains plain text. The corpus text was randomly shuffled document-internally. The sentence order is the same in both corpus versions.

For more information on the corpus please see: https://www.kielipankki.fi/corpora/hcs2/

License and access

  • Some versions of this resource are available publicly (PUB), whereas others might require you to log in as an academic user (ACA) or to apply for individual access rights (RES).
  • Click on the license image to see the resource-specific license text.
  • Some/all versions of this resource may contain personal data (license condition +PRIV). The license may then include additional data protection terms and conditions that you must follow. If processing personal data, maintain a public Privacy Notice regarding your project and provide the link to the Language Bank of Finland, see instructions.
  • Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.
  • Some versions of this resource are available in the computing environment (see column ’Location’). icon-question-circle

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record.


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2014032624

Last modified on 2025-03-03