The NLP4NLP Corpus (II): 50 Years of Research in Speech and Language Processing

Mariani Joseph; Francopoulo Gil; Paroubek Patrick; Vernier Frédéric

首页> 外文期刊>Frontiers in Research Metrics and Analytics >The NLP4NLP Corpus (II): 50 Years of Research in Speech and Language Processing

【24h】

The NLP4NLP Corpus (II): 50 Years of Research in Speech and Language Processing

机译：NLP4NLP语料库（II）：50年来语音和语言处理研究

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The NLP4NLP corpus contains articles published in 34 major conferences and journals in the field of speech and natural language processing over a period of 50 years (1965-2015), comprising 65,000 documents, gathering 50,000 authors, including 325,000 references and representing approximately 270 million words. This paper presents an analysis of this corpus regarding the evolution of the research topics, with the identification of the authors who introduced them and of the publication where they were first presented, and the detection of epistemological ruptures. Linking the metadata, the paper content and the references allowed us to propose a measure of innovation for the research topics, the authors and the publications. In addition, it allowed us to study the use of language resources, in the framework of the paradigm shift between knowledge-based approaches and content-based approaches, and the reuse of articles and plagiarism between sources over time. Numerous manual corrections were necessary, which demonstrated the importance of establishing standards for uniquely identifying authors, articles, resources or publications.

机译：NLP4NLP语料库包含在过去50年（1965-2015年）内在语音和自然语言处理领域的34个主要会议和期刊上发表的文章，包括65,000个文档，收集50,000位作者，包括325,000个参考文献，代表大约2.7亿个单词。本文介绍了该语料库在研究主题演变方面的分析，并确定了介绍这些主题的作者和首次提出该文献的出版物，以及认识论破裂的发现。通过链接元数据，论文内容和参考文献，我们可以针对研究主题，作者和出版物提出创新措施。此外，它使我们能够在基于知识的方法与基于内容的方法之间的范式转换的框架内研究语言资源的使用，以及随着时间的推移在源之间重复使用文章和窃。必须进行大量的手动更正，这表明建立标准以唯一标识作者，文章，资源或出版物的重要性。

著录项

来源
《Frontiers in Research Metrics and Analytics》 |2018年第3期|共页
作者
Mariani Joseph; Francopoulo Gil; Paroubek Patrick; Vernier Frédéric;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类信息与知识传播;
关键词
Speech ProcessingNatural Language Processingtext analyticsBibliometricsscientometricsInformetrics.;

机译：语音处理自然语言处理文本分析书目计量学科学计量学信息计量学。;

相似文献

外文文献
中文文献
专利

1. The NLP4NLP Corpus (I): 50 Years of Publication, Collaboration and Citation in Speech and Language Processing. [J] . Mariani Joseph, Francopoulo Gil, Paroubek Patrick Frontiers in Research Metrics and Analytics . 2018,第3期

机译：NLP4NLP语料库（I）：50年的语音和语言处理出版，协作和引用。
2. Pietro Torasso, the early years: Speech and language processing [J] . Barbara Di Eugenio, Vincenzo Lombardo Intelligenza Artificiale . 2018,第1期

机译：Pietro Torasso，早期：言语和语言处理
3. The role of early language experience in the development of speech perception and phonological processing abilities: evidence from 5-year-olds with histories of otitis media with effusion and low socioeconomic status. [J] . Nittrouer S, Burton LT Journal of communication disorders . 2005,第1期

机译：早期语言经验在语音感知和语音处理能力发展中的作用：来自5岁，患有中耳炎病史，渗出液且社会经济地位低下的证据。
4. Rediscovering 50 years of discoveries in speech and language processing: A survey [C] . Joseph Mariani, Gil Francopoulo, Patrick Paroubek, 2017 Conference of The Oriental Chater of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques . 2017

机译：重新调查语音和语言处理方面的50年发现：一项调查
5. A comparative study of three tests that measure receptive language processing ability in subjects ages eleven to twelve years, five months: The token test for children (Disimoni, 1978) the Fullerton language test for adolescents, oral commands subtest (Thorum, 1980) the clinical evaluation of language function, processing linguistic concepts and processing oral directions subtests (Semel & Wiig, 1980). [D] . Butchers, Helen. 1982

机译：对三种测量年龄在11至12岁，五个月的受试者中的接受语言处理能力的测试的比较研究：儿童的令牌测试（Disimoni，1978）青少年的Fullerton语言测试，口头命令子测试（Thorum，1980）临床语言功能评估，处理语言概念和处理口头指示子测验（Semel＆Wiig，1980）。
6. A randomised controlled trial of nonlinear frequency compression versus conventional processing in hearing aids: speech and language of children at 3 years of age [O] . Teresa YC Ching, Julia Day, Vicky Zhang, -1

机译：非线性频率压缩与助听器常规处理的随机对照试验：3岁儿童的语音和语言
7. The NLP4NLP Corpus (I): 50 Years of Publication, Collaboration and Citation in Speech and Language Processing [O] . Joseph Mariani, Gil Francopoulo, Patrick Paroubek 2019

机译：NLP4NLP语料库（i）：50年出版，协作和语言处理引用

The NLP4NLP Corpus (II): 50 Years of Research in Speech and Language Processing

摘要

著录项

相似文献

相关主题

期刊订阅