Newsletter of the Language Bank of Finland

Suomeksi

Researchers of the Month in 2022

  1. Jussi Ylikoski – Finno-Ugric languages, grammar, etymology
  2. Tuisku Vilenius – online discussions related to the Saami people
  3. Ari Huhta – language assessment, foreign language learning
  4. Terhi Ainiala – urban place names, digital discourses
  5. Mika Hämäläinen – computational creativity, language technology for endangered languages
  6. Jack Rueter – morpho-syntactic description of minority languages

All previous researchers of the month can be found in the archive.

Do you know researchers who use the Language Bank of Finland and who might be good candidates for Researcher of the Month? Would you be one of them? Inform us: https://www.kielipankki.fi/support/contact-us/

New corpora in 2022

Updated or extended corpora in 2022

Would you like to offer your own resource to be distributed via Kielipankki?

Submit the basic details about your own resource to the Language Bank of Finland: http://urn.fi/urn:nbn:fi:lb-2021121421

General-purpose HeLI-OTS language identifier released through industry-academia cooperation

HeLI-OTS is a general-purpose language identifier that can automatically detect the language used in a text. This ELG-compatible tool selects the most suitable option from a list of 200 languages. HeLI-OTS has been developed as part of a collaborative project between University of Helsinki and Lingsoft on text and speech recognition, funded by the Finnish Research Impact Foundation. Read more

Major Korp update

Korp has been updated to version 9. In addition to bug fixes, the new Korp has some new features, although some of them will be activated only when the required support for them has been added to corpora. Please report any bugs and deficiencies in the new Korp (and also wishes) either via feedback form or by email to fin-clarin (at) helsinki.fi.

The Donate Speech campaign data available via Kielipankki in autumn 2022

The Donate Speech campaign (Lahjoita puhetta) is still on. Of the 4000 hours of Finnish speech that were donated so far, 1500 hours have been manually transcribed. The donated speech material will be made available for restricted research and development purposes via the Language Bank of Finland in autumn 2022.

LUMI supercomputer in Kajaani, hosted by CSC, is now in action

LUMI is owned by the EuroHPC Joint Undertaking, and it is run by a consortium of 10 countries with long traditions and knowledge of scientific computing. LUMI is an ecosystem for high-performance computing, artificial intelligence, and data-intensive research, which enables breakthroughs in several branches of academic research. In addition, a fifth of LUMI’s capacity is targeted to companies. Read more

COST Action ”NexusLinguarum”: Virtual Mobility Grants to support research activities and networking in a virtual setting

Within the COST Action ”NexusLinguarum”, centered around linguistic data science, a new call for Virtual Mobility Grants (VMGs) has been issued with collection date 30th of June. VMGs are a networking tool launched by the COST Association and they aim to support individual participants to foster collaborative research activities, networking with other researchers and exchange of knowledge in a virtual setting. Moreover, you can still become a memher of one of the Working Groups within the Action. Read more

Apply for CLARIN funding

Did you know that CLARIN offers grants for, e.g., researcher and teacher mobility, events and training activities? Check out the funding opportunities and current calls: https://www.clarin.eu/funding

FIN-CLARIAH infrastructure introduced its goals in a poster exhibition

FIN-CLARIAH, the national research infrastructure for Social Sciences and Humanities in Finland received funding from the Academy of Finland for the years 2022-23. FIN-CLARIAH consists of two components, FIN-CLARIN and DARIAH-FI. We organized a kick-off event where posters were presented to introduce the goals and the work that will be done by the infrastructure. See the posters here.

Read more about FIN-CLARIN: FIN-CLARIN
Read more about DARIAH-FI: DARIAH-FI
For the roadmap of FIN-CLARIAH, see also: FIN-CLARIAH

The Language Bank of Finland wishes you a relaxing summer!

Mietta Lennes
Projet Planning Officer
fin-clarin@helsinki.fi

 


Subscribe/unsubscribe to this newsletter: https://www.kielipankki.fi/language-bank/newsletter-subscription/

See also the CLARIN Newsflash: https://www.clarin.eu/content/newsflash

 

Colourful butterfly with green and yellow background

 

Newsletter of the Language Bank of Finland

Researchers of the Month in 2021

  1. Mats Fridlund – research related to digital history
  2. Emmi Lahti – rhetoric and discourse studies
  3. Heikki Rasilo – speech production and its learning mechanisms
  4. Gwenaëlle Bauvois – research related to right-wing populism, countermedia, reinformation, hybrid media and post-truth
  5. Mila Oiva – Cultural History
  6. Karita Suomalainen – interactional linguistics
  7. Olli Kuparinen – variation and change in spoken Finnish
  8. Okko Räsänen – computational modeling of infant language development
  9. Juho Leinonen – automatic speech recognition, speech alignment and chatbots
  10. Veronika Laippala – large language resources and computational methods
  11. Mikko Kurimo – automatic speech recognition
  12. Jutta Salminen – expressing negation in Finnish

All previous researchers of the month can be found in the archive.

Do you know researchers who use the Language Bank of Finland and who might be good candidates for Researcher of the Month? Would you be one of them? Inform us: https://www.kielipankki.fi/support/contact-us/

Updates to resource-specific licenses and data protection terms and conditions

The resource-specific license terms and conditions will be updated in the near future. The most prominent change is that resource-specific data protection terms and conditions will be included in the licenses of those resources that contain personal data. Information about the license updates will be published on the Language Bank website. Read more about what to expect: https://www.kielipankki.fi/news/updates-to-resource-specific-licenses-and-data-protection-terms-and-conditions/

New corpora in 2021

Those corpora that were previously available via the LAT platform (discontinued in 2020) have been moved to the download service. The content of the downloadable corpora is essentially the same as in LAT, and the samples can be studied with, e.g., Praat or ELAN. At a later stage, we intend to make some speech corpora accessible via Korp as well. The current status and access location of each corpus can be seen on its metadata record and on the page of the resource group in question.

Would you like to offer your own resource to be distributed via Kielipankki?

Submit the basic details about your own resource to the Language Bank of Finland: http://urn.fi/urn:nbn:fi:lb-2021121421

New: Resource group pages

A given resource may be available as several different versions or variants that are provided for different purposes. The new resource group pages provide an overview of all the available versions. Read more

Korp will be updated soon

The official Korp update has been postponed until January. However, many of the new features and improvements can already be tested in Korplab. Your feedback is welcome! Read more

New Aalto-ASR module for automatic speech recognition and for aligning text with speech

The upgraded Aalto-ASR 2.1 is available for testing in the Puhti environment at CSC. If required, it is also possible to install the system on a local environment from a Docker container. Read more

Courses and awarded training materials

The online course Corpus Linguistics and Statistical Methods (5 ECTS) will be offered again in Jan-Mar 2021 and it can be taken either in Finnish or in English. The course is open to all universities and you can also participate in it from outside Finland. Course details

Apply for CLARIN funding

Did you know that CLARIN offers grants for, e.g., researcher and teacher mobility, events and training activities? Check out the funding opportunities and current calls: https://www.clarin.eu/funding

The Donate Speech campaign continues – Finnish and Finland-Swedish can be donated in parallel campaigns

The Donate Speech campaign (Lahjoita puhetta) is still on. You may now donate your speech in Swedish, too! Of the 4000 hours of Finnish speech that were donated so far, 1500 hours have been manually transcribed. Starting from spring 2022, the donated speech material will be made available for restricted research and development purposes via the Language Bank of Finland.

Kielipankki  – The Language Bank of Finland, Yle (Finnish Broadcasting Company) and the Donate Speech campaign were awarded three times in 2021

The new FIN-CLARIAH infrastructure receives funding from the Academy of Finland

FIN-CLARIAH, the national research infrastructure for Social Sciences and Humanities in Finland was granted 4.6 M€ by the Academy of Finland for the years 2022-23. FIN-CLARIAH consists of two components, FIN-CLARIN and DARIAH-FI.

Read more about FIN-CLARIN: FIN-CLARIN
Read more about DARIAH-FI: DARIAH-FI
For the roadmap of FIN-CLARIAH, see also: FIN-CLARIAH

The Language Bank of Finland wishes you nice and relaxing Christmas time!

Mietta Lennes
Projet Planning Officer
fin-clarin@helsinki.fi

 


Subscribe/unsubscribe to this newsletter: https://www.kielipankki.fi/language-bank/newsletter-subscription/

See also the CLARIN Newsflash: https://www.clarin.eu/content/newsflash

 

Luminen metsä ja ladut

 

Hae Kielipankki-portaalista:
Marjatta Palander
Kuukauden tutkija: Marjatta Palander

 

Tulevat tapahtumat

Yhteystiedot

Kielipankin tekninen ylläpito:
kielipankki (ät) csc.fi
p. 09 4572001

Aineistoihin ja muuhun sisältöön liittyvät asiat:
fin-clarin (ät) helsinki.fi
p. 029 4129317

Tarkemmat yhteystiedot