Textbasiertes neuronales kollaboratives Filtermodell für die Verfolgung von Papierquellen

papers.abstract

Die Identifizierung signifikanter Referenzen innerhalb der komplexen Wechselbeziehungen eines Zitationswissensgraphen ist herausfordernd, da sie Verbindungen über Zitationen, Autorschaft, Stichwörter und andere relationale Attribute umfasst. Die Aufgabe der Papierquellenverfolgung (PST) zielt darauf ab, die Identifizierung entscheidender Referenzen für gegebene wissenschaftliche Artikel mithilfe fortschrittlicher Datenanalysetechniken zu automatisieren. Im KDD CUP 2024 entwerfen wir ein auf Empfehlungen basierendes Framework, das speziell für die PST-Aufgabe entwickelt wurde. Dieses Framework verwendet das Neural Collaborative Filtering (NCF)-Modell zur Generierung endgültiger Vorhersagen. Um die textuellen Attribute der Artikel zu verarbeiten und Eingabemerkmale für das Modell zu extrahieren, nutzen wir SciBERT, ein vortrainiertes Sprachmodell. Gemäß den experimentellen Ergebnissen erzielte unsere Methode einen Wert von 0.37814 auf der Mean Average Precision (MAP)-Metrik, übertraf Basismodelle und belegte den 11. Platz unter allen teilnehmenden Teams. Der Quellcode ist öffentlich unter https://github.com/MyLove-XAB/KDDCupFinal verfügbar.

English

Identifying significant references within the complex interrelations of a citation knowledge graph is challenging, which encompasses connections through citations, authorship, keywords, and other relational attributes. The Paper Source Tracing (PST) task seeks to automate the identification of pivotal references for given scholarly articles utilizing advanced data mining techniques. In the KDD CUP 2024, we design a recommendation-based framework tailored for the PST task. This framework employs the Neural Collaborative Filtering (NCF) model to generate final predictions. To process the textual attributes of the papers and extract input features for the model, we utilize SciBERT, a pre-trained language model. According to the experimental results, our method achieved a score of 0.37814 on the Mean Average Precision (MAP) metric, outperforming baseline models and ranking 11th among all participating teams. The source code is publicly available at https://github.com/MyLove-XAB/KDDCupFinal.

Textbasiertes neuronales kollaboratives Filtermodell für die Verfolgung von Papierquellen

Text-Driven Neural Collaborative Filtering Model for Paper Source Tracing

papers.abstract

Support