게이트 QKAN-FWP: 확장 가능한 양자 영감 기반 시퀀스 학습

초록

고속 가중치 프로그래머(FWP)는 순환 은닉 상태 대신 동적으로 업데이트되는 매개변수를 통해 시간적 의존성을 부호화한다. 양자 FWP(QFWP)는 변분 양자 회로(VQC)를 통해 이 아이디어를 확장하지만, 기존 구현은 노이즈가 있는 중간 규모 양자(NISQ) 장치에서 확장이 어렵고 고전적으로 시뮬레이션하기에도 비용이 많이 드는 다중 큐비트 아키텍처에 의존한다. 본 논문에서는 데이터 재업로드 활성화(DARUAN)라고 알려진 학습 가능한 비선형 활성화 함수로서 단일 큐비트 데이터 재업로드 회로를 사용하여 FWP와 양자 영감 콜모고로프-아놀드 네트워크(QKAN)를 통합하는 고속 가중치 프레임워크인 게이트형 QKAN-FWP를 제안한다. 또한 적응형 메모리 커널, 기하학적 유계성, 병렬화 가능한 그래디언트 경로에 대한 이론적 분석을 바탕으로 매개변수 진화를 안정화하는 스칼라 게이트 고속 가중치 업데이트 규칙을 도입한다. 본 프레임워크를 시계열 벤치마크, MiniGrid 강화 학습을 통해 평가하고, 주요 실용적 결과로서 실제 태양 주기 예측을 강조한다. 528개월 입력 윈도우와 132개월 예측 지평을 갖는 장기 예측 설정에서, 본 12.5k 매개변수 모델은 장단기 메모리(LSTM) 네트워크(25.9k-89.1k 매개변수), WaveNet-LSTM(167k), 기본 순환 신경망(11.5k), 수정된 에코 상태 네트워크(132k)를 포함하여 최대 13배 더 많은 매개변수를 가진 일련의 고전적 순환 기준 모델보다 더 낮은 스케일된 평균 제곱 오차(MSE), 최대 진폭 오차, 최대 시점 오차를 달성한다. NISQ 호환성을 검증하기 위해, 학습된 고속 프로그래머를 IonQ 및 IBM 양자 프로세서에 배포하여 1024샷에서 잡음 없는 시뮬레이터 대비 상대 MSE 0.1% 이내의 예측 정확도를 회복한다. 이러한 결과는 게이트형 QKAN-FWP를 확장 가능하고, 매개변수 효율적이며, NISQ 호환 가능한 양자 영감 시퀀스 모델링 접근법으로 자리매김하게 한다.

English

Fast Weight Programmers (FWPs) encode temporal dependencies through dynamically updated parameters rather than recurrent hidden states. Quantum FWPs (QFWPs) extend this idea with variational quantum circuits (VQCs), but existing implementations rely on multi-qubit architectures that are difficult to scale on noisy intermediate-scale quantum (NISQ) devices and expensive to simulate classically. We propose gated QKAN-FWP, a fast-weight framework that integrates FWP with Quantum-inspired Kolmogorov-Arnold Network (QKAN) using single-qubit data re-uploading circuits as learnable nonlinear activation, known as DatA Re-Uploading ActivatioN (DARUAN). We further introduce a scalar-gated fast-weight update rule that stabilizes parameter evolution, supported by a theoretical analysis of its adaptive memory kernel, geometric boundedness, and parallelizable gradient paths. We evaluate the framework across time-series benchmarks, MiniGrid reinforcement learning, and highlight real-world solar cycle forecasting as our main practical result. In the long-horizon setting with 528-month input window and 132-month forecast horizon, our 12.5k-parameter model achieves lower scaled Mean Square Error (MSE), peak amplitude error, and peak timing error than a suite of classical recurrent baselines with up to 13x more parameters, including Long Short-Term Memory (LSTM) networks (25.9k-89.1k parameters), WaveNet-LSTM (167k), Vanilla recurrent neural network (11.5k), and a Modified Echo State Network (132k). To validate NISQ compatibility, we further deploy the trained fast programmer on IonQ and IBM Quantum processors, recovering forecasting accuracy within 0.1% relative MSE of the noiseless simulator at 1024 shots. These results position gated QKAN-FWP as a scalable, parameter-efficient, and NISQ-compatible approach to quantum-inspired sequence modeling.