Language Resources and Evaluation

Title Publication Date Language Citations
IEMOCAP: interactive emotional dyadic motion capture database2008/11/05English840
Annotating Expressions of Opinions and Emotions in Language2005/05/01English334
The WaCky wide web: a collection of very large linguistically processed web-crawled corpora2009/02/10English263
A multidimensional approach for detecting irony in Twitter2012/07/24English107
The TORGO database of acoustic and articulatory speech from speakers with dysarthria2011/03/26English93
Resources and benchmark corpora for hate speech detection: a systematic review2020/09/30English83
A large-scale classification of English verbs2007/12/12English78
Balanced corpus of contemporary written Japanese2013/12/29English78
The ACL anthology network corpus2013/01/06English68
Authorship attribution in the wild2010/01/13English67
Lexical association measures and collocation extraction2009/10/21English61
Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus2007/10/10English59
Cross-language plagiarism detection2010/01/30English55
Comparative evaluation of text classification techniques using a large diverse Arabic dataset2013/03/10English53
The MUMIN coding scheme for the annotation of feedback, turn management and sequencing phenomena2007/12/01English48
What’s missing in geographical parsing?2017/03/07English47
Lightweight methods to estimate influenza rates and alcohol sales volume from Twitter messages2012/05/13English44
FactBank: a corpus annotated with event factuality2009/05/07English43
Developing a corpus of plagiarised short answers2010/01/16English42
MedSTS: a resource for clinical semantic textual similarity2018/10/24English41
Annotating expressions of Appraisal in English2010/12/12English40
Automatic keyphrase extraction from scientific articles2012/12/18English39
Language resources for Hebrew2007/12/13English38
A massively parallel corpus: the Bible in 100 languages2014/11/19English36
Multilingual and cross-domain temporal tagging2012/05/08English35
SentiTurkNet: a Turkish polarity lexicon for sentiment analysis2015/06/20English34
Intrinsic plagiarism analysis2010/01/20English34
Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit2010/05/27English33
ELRA – European Language Resources Association-Background, Recent Developments and Future Perspectives2005/02/01English32
Perspectives on crowdsourcing annotations for natural language processing2012/03/16English31