...
首页> 外文期刊>Frontiers in Research Metrics and Analytics >Resolving Citation Links with Neural Networks
【24h】

Resolving Citation Links with Neural Networks

机译:用神经网络解析引文链接

获取原文
           

摘要

This work demonstrates how neural network models (NNs) can be exploited towards resolving citation links in the scientific literature, which involves locating passages in the source paper the author had intended when citing the paper. We look at two kinds of models: triplet and binary. The triplet network model works by ranking potential candidates, using what is generally known as the triplet loss, while the binary model tackles the issue by turning it into a binary decision problem, i.e., by labeling a candidate as true or false, depending on how likely a target it is. Experiments are conducted using three datasets developed by the CL-SciSumm project from a large repository of scientific papers in the Association for Computational Linguistics (ACL) repository. The results find that NNs are extremely susceptible to how the input is represented: they perform better on inputs expressed in binary format than on those encoded using the TFIDF metric or neural embeddings. Furthermore, in response to a difficulty NNs and baselines had in predicting the exact location of a target, we introduce the idea of approximately correct targets (ACTs) where the goal is to find a region which likely contains a true target rather than its exact location. We show that with the ACTs, NNs consistently outperform Ranking SVM and TFIDF on the aforementioned datasets.
机译:这项工作演示了如何利用神经网络模型(NNs)来解决科学文献中的引文链接,这涉及在作者引用该论文时打算在源论文中找到段落。我们看两种模型:三元组和二进制。三元组网络模型通过使用通常所说的三元组损失对潜在的候选者进行排名来工作,而二元模型则通过将其转化为二元决策问题来解决该问题,即,根据如何将候选者标记为真或假。可能是目标。使用由CL-SciSumm项目开发的三个数据集进行实验,该数据集来自计算语言协会(ACL)知识库中的大型科学论文仓库。结果发现,NN极易受到输入表示方式的影响:与以TFIDF度量或神经嵌入编码的输入相比,它们在以二进制格式表示的输入上表现更好。此外,针对神经网络和基线难以预测目标的确切位置的问题,我们引入了近似正确目标(ACT)的概念,目标是找到可能包含真实目标而不是其精确位置的区域。我们表明,使用ACT,NN在上述数据集上的性能始终优于Rank SVM和TFIDF。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号