Penyusunan Korpus Bahasa Daerah

<p dir="ltr">Slides presentation for an online invited talk (in Indonesian) at the workshop on creating regional language corpora (24 October 2024, 16.00 Jakarta). The workshop was organised by the Language and Literature Preservation working group, of the National Language Agency of...

Full description

Saved in:
Bibliographic Details
Main Author: Gede Primahadi Wijaya Rajeg (14592023) (author)
Published: 2024
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1852025074917310464
author Gede Primahadi Wijaya Rajeg (14592023)
author_facet Gede Primahadi Wijaya Rajeg (14592023)
author_role author
dc.creator.none.fl_str_mv Gede Primahadi Wijaya Rajeg (14592023)
dc.date.none.fl_str_mv 2024-11-20T14:39:31Z
dc.identifier.none.fl_str_mv 10.25446/oxford.27867759.v1
dc.relation.none.fl_str_mv https://figshare.com/articles/presentation/Penyusunan_Korpus_Bahasa_Daerah/27867759
dc.rights.none.fl_str_mv CC BY-NC-SA 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Indigenous data and data technologies
Data management and data science not elsewhere classified
Indonesian languages
Language studies not elsewhere classified
Corpus linguistics
Language documentation and description
Linguistics not elsewhere classified
linguistik korpus kuantitatif
quantitative corpus linguistics
linguistik korpus
Corpus Linguistics
language documention
corpus creation
dokumentasi bahasa
pembangunan korpus bahasa
Indonesian languages
Indonesian regional languages
Endangered Language Documentation
bahasa daerah
regional language corpora
regional language infrastructure
Enggano language
Balinese language
historical corpora
lexical databases
dc.title.none.fl_str_mv Penyusunan Korpus Bahasa Daerah
dc.type.none.fl_str_mv Text
Presentation
info:eu-repo/semantics/publishedVersion
text
description <p dir="ltr">Slides presentation for an online invited talk (in Indonesian) at the workshop on creating regional language corpora (24 October 2024, 16.00 Jakarta). The workshop was organised by the Language and Literature Preservation working group, of the National Language Agency of Indonesia. The talk began with presenting the concept of corpus and the <a href="https://www.go-fair.org/fair-principles/" rel="noreferrer" target="_blank">FAIR</a> data principle for wider access to the created corpora, especially for the native speakers of the language, in addition to researchers/academics. Next, one of the case studies presented is the creation of the Contemporary Enggano language corpora and Enggano legacy materials of texts and word lists, including the expected output of Enggano digital and print dictionaries, among others. In addition, cross-linguistic parallel corpora creation from picture stimuli (the <a href="https://scopicproject.wordpress.com/" rel="noreferrer" target="_blank">SCOPIC project</a>), especially the Balinese corpora, was also discussed. Then, regarding the model for infrastructure, the <a href="https://www.ldaca.edu.au/" rel="noreferrer" target="_blank">language data across Australia</a> project was presented. Finally, there is a mention of community-based development of digital infrastructure centring around the <a href="https://dictionary.basabali.org/Main_Page" rel="noreferrer" target="_blank">Balinese language dictionary and content</a>.</p><p dir="ltr">Rajeg, Gede Primahadi Wijaya (2024). Penyusunan Korpus Bahasa Daerah. University of Oxford. Presentation. <a href="https://doi.org/10.25446/oxford.27867759" rel="noreferrer" target="_blank">https://doi.org/10.25446/oxford.27867759</a></p>
eu_rights_str_mv openAccess
id Manara_2c18248e98a6f2ee90fdd8bfec3e7bdf
identifier_str_mv 10.25446/oxford.27867759.v1
network_acronym_str Manara
network_name_str ManaraRepo
oai_identifier_str oai:figshare.com:article/27867759
publishDate 2024
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY-NC-SA 4.0
spelling Penyusunan Korpus Bahasa DaerahGede Primahadi Wijaya Rajeg (14592023)Indigenous data and data technologiesData management and data science not elsewhere classifiedIndonesian languagesLanguage studies not elsewhere classifiedCorpus linguisticsLanguage documentation and descriptionLinguistics not elsewhere classifiedlinguistik korpus kuantitatifquantitative corpus linguisticslinguistik korpusCorpus Linguisticslanguage documentioncorpus creationdokumentasi bahasapembangunan korpus bahasaIndonesian languagesIndonesian regional languagesEndangered Language Documentationbahasa daerahregional language corporaregional language infrastructureEnggano languageBalinese languagehistorical corporalexical databases<p dir="ltr">Slides presentation for an online invited talk (in Indonesian) at the workshop on creating regional language corpora (24 October 2024, 16.00 Jakarta). The workshop was organised by the Language and Literature Preservation working group, of the National Language Agency of Indonesia. The talk began with presenting the concept of corpus and the <a href="https://www.go-fair.org/fair-principles/" rel="noreferrer" target="_blank">FAIR</a> data principle for wider access to the created corpora, especially for the native speakers of the language, in addition to researchers/academics. Next, one of the case studies presented is the creation of the Contemporary Enggano language corpora and Enggano legacy materials of texts and word lists, including the expected output of Enggano digital and print dictionaries, among others. In addition, cross-linguistic parallel corpora creation from picture stimuli (the <a href="https://scopicproject.wordpress.com/" rel="noreferrer" target="_blank">SCOPIC project</a>), especially the Balinese corpora, was also discussed. Then, regarding the model for infrastructure, the <a href="https://www.ldaca.edu.au/" rel="noreferrer" target="_blank">language data across Australia</a> project was presented. Finally, there is a mention of community-based development of digital infrastructure centring around the <a href="https://dictionary.basabali.org/Main_Page" rel="noreferrer" target="_blank">Balinese language dictionary and content</a>.</p><p dir="ltr">Rajeg, Gede Primahadi Wijaya (2024). Penyusunan Korpus Bahasa Daerah. University of Oxford. Presentation. <a href="https://doi.org/10.25446/oxford.27867759" rel="noreferrer" target="_blank">https://doi.org/10.25446/oxford.27867759</a></p>2024-11-20T14:39:31ZTextPresentationinfo:eu-repo/semantics/publishedVersiontext10.25446/oxford.27867759.v1https://figshare.com/articles/presentation/Penyusunan_Korpus_Bahasa_Daerah/27867759CC BY-NC-SA 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/278677592024-11-20T14:39:31Z
spellingShingle Penyusunan Korpus Bahasa Daerah
Gede Primahadi Wijaya Rajeg (14592023)
Indigenous data and data technologies
Data management and data science not elsewhere classified
Indonesian languages
Language studies not elsewhere classified
Corpus linguistics
Language documentation and description
Linguistics not elsewhere classified
linguistik korpus kuantitatif
quantitative corpus linguistics
linguistik korpus
Corpus Linguistics
language documention
corpus creation
dokumentasi bahasa
pembangunan korpus bahasa
Indonesian languages
Indonesian regional languages
Endangered Language Documentation
bahasa daerah
regional language corpora
regional language infrastructure
Enggano language
Balinese language
historical corpora
lexical databases
status_str publishedVersion
title Penyusunan Korpus Bahasa Daerah
title_full Penyusunan Korpus Bahasa Daerah
title_fullStr Penyusunan Korpus Bahasa Daerah
title_full_unstemmed Penyusunan Korpus Bahasa Daerah
title_short Penyusunan Korpus Bahasa Daerah
title_sort Penyusunan Korpus Bahasa Daerah
topic Indigenous data and data technologies
Data management and data science not elsewhere classified
Indonesian languages
Language studies not elsewhere classified
Corpus linguistics
Language documentation and description
Linguistics not elsewhere classified
linguistik korpus kuantitatif
quantitative corpus linguistics
linguistik korpus
Corpus Linguistics
language documention
corpus creation
dokumentasi bahasa
pembangunan korpus bahasa
Indonesian languages
Indonesian regional languages
Endangered Language Documentation
bahasa daerah
regional language corpora
regional language infrastructure
Enggano language
Balinese language
historical corpora
lexical databases