Personaliseerbare lange-context symbolische muziekinvulling met MIDI-RWKV

Samenvatting

Bestaand onderzoek op het gebied van automatische muziekgeneratie heeft zich voornamelijk gericht op end-to-end systemen die complete composities of voortzettingen produceren. Omdat muzikale compositie echter typisch een iteratief proces is, maken dergelijke systemen het moeilijk om de wisselwerking tussen mens en machine te faciliteren, wat essentieel is voor computerondersteunde creativiteit. In deze studie richten we ons op de taak van personaliseerbare, multi-track, lange-context en controleerbare symbolische muzikinvulling om het proces van computerondersteunde compositie te verbeteren. We presenteren MIDI-RWKV, een nieuw model gebaseerd op de RWKV-7 lineaire architectuur, om efficiënte en samenhangende muzikale cocreatie op edge-apparaten mogelijk te maken. We tonen ook aan dat MIDI-RWKV een effectieve methode biedt om de initiële staat af te stemmen voor personalisatie in het zeer-laag-sample regime. We evalueren MIDI-RWKV en de staat-afstemming op verschillende kwantitatieve en kwalitatieve metrieken, en geven modelgewichten en code vrij op https://github.com/christianazinn/MIDI-RWKV.

English

Existing work in automatic music generation has primarily focused on end-to-end systems that produce complete compositions or continuations. However, because musical composition is typically an iterative process, such systems make it difficult to engage in the back-and-forth between human and machine that is essential to computer-assisted creativity. In this study, we address the task of personalizable, multi-track, long-context, and controllable symbolic music infilling to enhance the process of computer-assisted composition. We present MIDI-RWKV, a novel model based on the RWKV-7 linear architecture, to enable efficient and coherent musical cocreation on edge devices. We also demonstrate that MIDI-RWKV admits an effective method of finetuning its initial state for personalization in the very-low-sample regime. We evaluate MIDI-RWKV and its state tuning on several quantitative and qualitative metrics, and release model weights and code at https://github.com/christianazinn/MIDI-RWKV.

Personaliseerbare lange-context symbolische muziekinvulling met MIDI-RWKV

Personalizable Long-Context Symbolic Music Infilling with MIDI-RWKV

Samenvatting

Support