Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation

<p dir="ltr">Creating symbolic melodies with machine learning is challenging because it requires an understanding of musical structure and the handling of inter-dependencies and long-term dependencies. Learning the relationship between events that occur far apart in time in music pos...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Ahmet Kasif (17787560) (author)
مؤلفون آخرون: Selcuk Sevgen (21734567) (author), Alper Ozcan (21734570) (author), Cagatay Catal (6897842) (author)
منشور في: 2024
الموضوعات:
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1864513544633974784
author Ahmet Kasif (17787560)
author2 Selcuk Sevgen (21734567)
Alper Ozcan (21734570)
Cagatay Catal (6897842)
author2_role author
author
author
author_facet Ahmet Kasif (17787560)
Selcuk Sevgen (21734567)
Alper Ozcan (21734570)
Cagatay Catal (6897842)
author_role author
dc.creator.none.fl_str_mv Ahmet Kasif (17787560)
Selcuk Sevgen (21734567)
Alper Ozcan (21734570)
Cagatay Catal (6897842)
dc.date.none.fl_str_mv 2024-02-08T03:00:00Z
dc.identifier.none.fl_str_mv 10.1007/s11042-024-18491-7
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/Hierarchical_multi-head_attention_LSTM_for_polyphonic_symbolic_melody_generation/29590373
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Creative arts and writing
Music
Engineering
Electronics, sensors and digital hardware
Information and computing sciences
Artificial intelligence
Human-centred computing
Machine learning
Symbolic music
Music generation
Recurrent networks
Multi-head attention
dc.title.none.fl_str_mv Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p dir="ltr">Creating symbolic melodies with machine learning is challenging because it requires an understanding of musical structure and the handling of inter-dependencies and long-term dependencies. Learning the relationship between events that occur far apart in time in music poses a considerable challenge for machine learning models. Another notable feature of music is that notes must account for several inter-dependencies, including melodic, harmonic, and rhythmic aspects. Baseline methods, such as RNNs, LSTMs, and GRUs, often struggle to capture these dependencies, resulting in the generation of musically incoherent or repetitive melodies. As such, in this study, a hierarchical multi-head attention LSTM model is proposed for creating polyphonic symbolic melodies. This enables our model to generate more complex and expressive melodies than previous methods, while still being musically coherent. The model allows learning of long-term dependencies at different levels of abstraction, while retaining the ability to form inter-dependencies. The study has been conducted on two major symbolic music datasets, MAESTRO and Classical-Music MIDI, which feature musical content encoded on MIDI. The artistic nature of music poses a challenge to evaluating the generated content and qualitative analysis are often not enough. Thus, human listening tests are conducted to strengthen the evaluation. Qualitative analysis conducted on the generated melodies shows significantly improved loss scores on MSE over baseline methods, and is able to generate melodies that were both musically coherent and expressive. The listening tests conducted using Likert-scale support the qualitative results and provide better statistical scores over baseline methods.</p><h2>Other Information</h2><p dir="ltr">Published in: Multimedia Tools and Applications<br>License: <a href="https://creativecommons.org/licenses/by/4.0" target="_blank">https://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1007/s11042-024-18491-7" target="_blank">https://dx.doi.org/10.1007/s11042-024-18491-7</a></p>
eu_rights_str_mv openAccess
id Manara2_37659bd68fd8f5ff0bdbefd56d524b71
identifier_str_mv 10.1007/s11042-024-18491-7
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/29590373
publishDate 2024
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Hierarchical multi-head attention LSTM for polyphonic symbolic melody generationAhmet Kasif (17787560)Selcuk Sevgen (21734567)Alper Ozcan (21734570)Cagatay Catal (6897842)Creative arts and writingMusicEngineeringElectronics, sensors and digital hardwareInformation and computing sciencesArtificial intelligenceHuman-centred computingMachine learningSymbolic musicMusic generationRecurrent networksMulti-head attention<p dir="ltr">Creating symbolic melodies with machine learning is challenging because it requires an understanding of musical structure and the handling of inter-dependencies and long-term dependencies. Learning the relationship between events that occur far apart in time in music poses a considerable challenge for machine learning models. Another notable feature of music is that notes must account for several inter-dependencies, including melodic, harmonic, and rhythmic aspects. Baseline methods, such as RNNs, LSTMs, and GRUs, often struggle to capture these dependencies, resulting in the generation of musically incoherent or repetitive melodies. As such, in this study, a hierarchical multi-head attention LSTM model is proposed for creating polyphonic symbolic melodies. This enables our model to generate more complex and expressive melodies than previous methods, while still being musically coherent. The model allows learning of long-term dependencies at different levels of abstraction, while retaining the ability to form inter-dependencies. The study has been conducted on two major symbolic music datasets, MAESTRO and Classical-Music MIDI, which feature musical content encoded on MIDI. The artistic nature of music poses a challenge to evaluating the generated content and qualitative analysis are often not enough. Thus, human listening tests are conducted to strengthen the evaluation. Qualitative analysis conducted on the generated melodies shows significantly improved loss scores on MSE over baseline methods, and is able to generate melodies that were both musically coherent and expressive. The listening tests conducted using Likert-scale support the qualitative results and provide better statistical scores over baseline methods.</p><h2>Other Information</h2><p dir="ltr">Published in: Multimedia Tools and Applications<br>License: <a href="https://creativecommons.org/licenses/by/4.0" target="_blank">https://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1007/s11042-024-18491-7" target="_blank">https://dx.doi.org/10.1007/s11042-024-18491-7</a></p>2024-02-08T03:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1007/s11042-024-18491-7https://figshare.com/articles/journal_contribution/Hierarchical_multi-head_attention_LSTM_for_polyphonic_symbolic_melody_generation/29590373CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/295903732024-02-08T03:00:00Z
spellingShingle Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation
Ahmet Kasif (17787560)
Creative arts and writing
Music
Engineering
Electronics, sensors and digital hardware
Information and computing sciences
Artificial intelligence
Human-centred computing
Machine learning
Symbolic music
Music generation
Recurrent networks
Multi-head attention
status_str publishedVersion
title Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation
title_full Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation
title_fullStr Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation
title_full_unstemmed Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation
title_short Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation
title_sort Hierarchical multi-head attention LSTM for polyphonic symbolic melody generation
topic Creative arts and writing
Music
Engineering
Electronics, sensors and digital hardware
Information and computing sciences
Artificial intelligence
Human-centred computing
Machine learning
Symbolic music
Music generation
Recurrent networks
Multi-head attention