궤적을 따라 사고하기: 비디오 생성 기술을 활용하여 셀룰러 신호에서 GPS 궤적 재구성하기

초록

모바일 기기는 셀룰러 기지국과 지속적으로 상호작용하며, 인간 이동성을 이해하기 위한 광범위한 커버리지를 제공하는 방대한 양의 신호 기록을 생성합니다. 그러나 이러한 기록은 단순한 위치 단서(예: 서빙 셀 식별자)만 제공하므로, 고정밀 GPS 궤적이 필요한 애플리케이션에 직접 사용하는 데는 한계가 있습니다. 본 논문은 셀룰러 신호로부터 GPS 궤적을 재구성하는 Sig2GPS 문제를 연구합니다. 기존의 복잡한 다단계 엔지니어링 파이프라인에 의존하거나 좌표를 회귀하는 방식과 달리, 도메인 전문가들이 종종 지도 위에 신호 흔적을 표시하고 해당 GPS 경로를 스케치하는 방식에서 영감을 받아, Sig2GPS는 지도-시각 영역에서 직접 작동하는 이미지-비디오 생성 작업으로 재구성되었습니다: 신호 흔적을 지도에 렌더링하고, 비디오 생성 모델을 학습시켜 연속적인 GPS 경로를 그리도록 합니다. 이러한 패러다임을 지원하기 위해, 오픈소스 비디오 모델을 미세 조정할 수 있도록 쌍을 이루는 신호-궤적 비디오 데이터셋을 구축하고, 보상을 통해 생성 정확도를 향상시키는 궤적 인식 강화 학습 기반 최적화 방법을 도입했습니다. 대규모 실제 데이터셋에 대한 실험 결과, 강력한 엔지니어링 및 학습 기반 베이스라인 대비 상당한 향상을 보였으며, 다음 GPS 위치 예측에 대한 추가 결과는 확장성과 도시 간 전이 가능성을 시사합니다. 전반적으로 이러한 결과는 지도-시각 비디오 생성이 지도 제약 조건 하에서 연속 경로를 직접 생성하고 정제할 수 있도록 함으로써 궤적 데이터 마이닝을 위한 실용적인 인터페이스를 제공함을 보여줍니다.

English

Mobile devices continuously interact with cellular base stations, generating massive volumes of signaling records that provide broad coverage for understanding human mobility. However, such records offer only coarse location cues (e.g., serving-cell identifiers) and therefore limit their direct use in applications that require high-precision GPS trajectories. This paper studies the Sig2GPS problem: reconstructing GPS trajectories from cellular signaling. Inspired by domain experts often lay the signaling trace on the map and sketch the corresponding GPS route, unlike conventional solutions that rely on complex multi-stage engineering pipelines or regress coordinates, Sig2GPS is reframed as an image-to-video generation task that directly operates in the map-visual domain: signaling traces are rendered on a map, and a video generation model is trained to draw a continuous GPS path. To support this paradigm, a paired signaling-to-trajectory video dataset is constructed to fine-tune an open-source video model, and a trajectory-aware reinforcement learning-based optimization method is introduced to improve generation fidelity via rewards. Experiments on large-scale real-world datasets show substantial improvements over strong engineered and learning-based baselines, while additional results on next GPS prediction indicate scalability and cross-city transferability. Overall, these results suggest that map-visual video generation provides a practical interface for trajectory data mining by enabling direct generation and refinement of continuous paths under map constraints.

궤적을 따라 사고하기: 비디오 생성 기술을 활용하여 셀룰러 신호에서 GPS 궤적 재구성하기

Think over Trajectories: Leveraging Video Generation to Reconstruct GPS Trajectories from Cellular Signaling

초록

Support