JOSE CAMACHO COLLADOS

Natural Language Processing

Jose Camacho Collados

publications

Jose Camacho Collados

2023

Asahi Ushio, Yi Zhou, Jose Camacho-Collados.
An Efficient Multilingual Language Model Compression through Vocabulary Trimming. [paper]
Findings of EMNLP 2023, Singapore.
 
Asahi Ushio, Jose Camacho-Collados, Steven Schockaert.
RelBERT: Embedding Relations with Language Models. [paper]
arXiv (2023).
 
Dimosthenis Antypas and Jose Camacho-Collados.
Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation. [paper] [model]
ACL 2023 WOAH Workshop, Toronto, Canada.
 
Asahi Ushio, Fernando Alva-Manchego, Jose Camacho-Collados.
An Empirical Comparison of LM-based Question and Answer Generation Methods. [paper]
Findings of ACL 2023, Toronto, Canada.
 
Asahi Ushio, Fernando Alva-Manchego, Jose Camacho-Collados.
A Practical Toolkit for Multilingual Question and Answer Generation. [paper] [demo] [code]
ACL 2023 (Demo), Toronto, Canada.
 
Dimosthenis Antypas, Alun Preece and Jose Camacho-Collados.
Negativity spreads faster: A large-scale multilingual twitter analysis on the role of sentiment in political communication. [paper] [data&code]
Online Social Networks and Media Journal (2023).
 
David Owen, Dimosthenis Antypas, Athanasios Hassoulas, Antonio F Pardiñas, Luis Espinosa-Anke and Jose Camacho Collados.
Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation. [paper]
JMIR AI Journal (2023).
 

2022

Kiamehr Rezaee and Jose Camacho-Collados.
Probing Relational Knowledge in Language Models via Word Analogies. [paper]
Findings of EMNLP 2022, Abu Dhabi, UAE.
 
Asahi Ushio, Fernando Alva-Manchego and Jose Camacho-Collados.
Generative Language Models for Paragraph-Level Question Generation. [paper] [code] [demo]
EMNLP 2022, Abu Dhabi, UAE.
 
Jose Camacho-Collados, Kiamehr Rezaee, Talayeh Riahi, Asahi Ushio, Daniel Loureiro, Dimosthenis Antypas, Joanne Boisson, Luis Espinosa-Anke, Fangyu Liu, Eugenio Martínez-Cámara, Gonzalo Medina, Thomas Buhrmann, Leonardo Neves and Francesco Barbieri.
TweetNLP: Cutting-Edge Natural Language Processing for Social Media. [paper] [code] [demo]
EMNLP 2022 (Demo), Abu Dhabi, UAE.
 
Aleksandra Edwards, Asahi Ushio, Jose Camacho-Collados, Hélène de Ribaupierre and Alun Preece.
Guiding Generative Language Models for Data Augmentation in Few-Shot Text Classification. [paper]
EMNLP 2022 DaSH Workshop, Abu Dhabi, UAE.
 
Daniel Loureiro, Aminette D'Souza, Areej Nasser Muhajab, Isabella A. White, Gabriel Wong, Luis Espinosa Anke, Leonardo Neves, Francesco Barbieri and Jose Camacho-Collados.
TempoWiC: An Evaluation Benchmark for Detecting Meaning Shift in Social Media. [paper] [data&code]
COLING 2022 (short), Gyeongju, Republic of Korea.
 
Dimosthenis Antypas*, Asahi Ushio*, Jose Camacho-Collados, Leonardo Neves, Vítor Silva and Francesco Barbieri.
Twitter Topic Classification. [paper] [data]
COLING 2022, Gyeongju, Republic of Korea.
 
Mark Anderson and Jose Camacho-Collados.
Assessing the Limits of the Distributional Hypothesis in Semantic Spaces: Trait-based Relational Knowledge and the Impact of Co-occurrences. [paper] [data]
*SEM 2022, Seattle, USA.
 
Joanne Boisson, Luis Espinosa-Anke and Jose Camacho-Collados.
CardiffNLP-Metaphor at SemEval-2022 Task 2: Targeted Fine-tuning of Transformer-based Language Models for Idiomaticity Detection. [paper] [code]
SemEval 2022, Seattle, USA.
 
Daniel Loureiro, Francesco Barbieri, Leonardo Neves, Luis Espinosa Anke and Jose Camacho-Collados.
TimeLMs: Diachronic Language Models from Twitter. [paper] [data&code]
ACL 2022 (Demo), Dublin, Ireland.
 
Francesco Barbieri, Luis Espinosa Anke and Jose Camacho-Collados.
XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond. [paper] [data&code]
LREC 2022, Marseille, France.
 
Daniel Loureiro, Alípio Mário Jorge and Jose Camacho-Collados.
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond. [paper] [data&code]
Artificial Intelligence Journal (2022).
 

2021

Asahi Ushio, Jose Camacho-Collados and Steven Schockaert.
Distilling Relation Embeddings from Pretrained Language Models [paper] [data&code]
EMNLP 2021
 
Asahi Ushio, Federico Liberatore and Jose Camacho-Collados.
Back to the Basics: A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword Extraction [paper] [data&code]
EMNLP 2021
 
Asahi Ushio, Luis Espinosa-Anke, Steven Schockaert and Jose Camacho-Collados.
BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies? [paper] [data&code]
ACL 2021
 
Dimosthenis Antypas, David Rogers, Alun Preece and Jose Camacho-Collados.
COVID-19 and Misinformation: A Large-Scale Lexical Analysis on Twitter. [paper]
ACL 2021 Student Research Workshop
 
David Tuxworth, Dimosthenis Antypas, Luis Espinosa-Anke, Jose Camacho-Collados, Alun Preece and David Rogers.
Deriving Disinformation Insights from Geolocalized Twitter Callouts. [paper]
KDD 2021 Workshop On Deriving Insights From User-Generated Text
 
Daniel Loureiro*, Kiamehr Rezaee*, Mohammad Taher Pilehvar and Jose Camacho-Collados.
Language Models and Word Sense Disambiguation: An Overview and Analysis. [paper] [data&code]
Computational Linguistics (2021)
 
Na Li, Zied Bouraoui, Jose Camacho-Collados, Luis Espinosa-Anke, Qing Gu and Steven Schockaert.
Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings. [paper]
IJCAI 2021
 
Asahi Ushio and Jose Camacho-Collados.
T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition. [paper] [data&code]
EACL 2021 (demo)
 
Anna Breit, Artem Revenko, Kiamehr Rezaee, Mohammad Taher Pilehvar and Jose Camacho-Collados.
WiC-TSV: An Evaluation Benchmark for Target Sense Verification of Words in Context. [paper] [data&challenge]
EACL 2021
 
Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke and Steven Schockaert.
Meemi: Finding the Middle Ground in Cross-lingual Word Embeddings. [paper] [data&code]
Natural Language Engineering (2021).
 
Aleksandra Edwards, David Rogers, Jose Camacho-Collados, Hélène de Ribaupierre and Alun Preece.
Predicting Themes within Complex Unstructured Texts: A Case Study on Safeguarding Reports. [paper]
ESWC 2021 DeepOntoNLP Workshop
 

2020

David Owen, Jose Camacho-Collados and Luis Espinosa-Anke.
Towards Preemptive Detection of Depression and Anxiety in Twitter. [paper] [data]
COLING 2020 Workshop on Social Media Mining for Health Applications.
 
Mireia Roig Mirapeix, Luis Espinosa Anke and Jose Camacho-Collados.
Definition Extraction Feature Analysis: From Canonical to Naturally-Occurring Definitions. [paper]
COLING 2020 Workshop on Cognitive Aspects of the Lexicon.
 
Aleksandra Edwards, Jose Camacho-Collados, Hélène De Ribaupierre, Alun Preece.
Go Simple and Pre-Train on Domain-Specific Corpora: On the Role of Training Data for Text Classification. [paper] [data]
COLING 2020 (short)
 
Hsiao-Yu Chiang, Jose Camacho-Collados and Zachary Pardos.
Understanding the Source of Semantic Regularities in Word Embeddings. [paper]
CoNLL 2020.
 
Francesco Barbieri, Jose Camacho-Collados, Leonardo Neves and Luis Espinosa-Anke.
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification. [paper] [data]
Findings of EMNLP 2020.
 
Alessandro Raganato*, Tommaso Pasini*, Jose Camacho-Collados and Mohammad Taher Pilehvar.
XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization. [paper] [data] [competition]
EMNLP 2020.
 
Daniel Loureiro and Jose Camacho-Collados.
Don't Neglect the Obvious: On the Role of Unambiguous Words in Word Sense Disambiguation. [paper] [data&code]
EMNLP 2020 (short)
 
Tomoki Ito, Jose Camacho-Collados, Hiroki Sakaji and Steven Schockaert.
Learning Company Embeddings from Annual Reports for Fine-grained Industry Characterization. [paper] [data&code]
IJCAI 2020 Workshop on Financial Technology and Natural Language Processing, Yokohama, Japan.
 
Jae Hee Lee, Jose Camacho-Collados, Luis Espinosa-Anke and Steven Schockaert.
Capturing Word Order in Averaging Based Sentence Embeddings. [paper] [code]
ECAI 2020, Santiago de Compostela, Spain.
 
Zied Bouraoui, Jose Camacho-Collados and Steven Schockaert.
Inducing Relational Knowledge from BERT. [paper]
AAAI 2020, New York, USA.
 
Zied Bouraoui, Jose Camacho-Collados, Luis Espinosa-Anke and Steven Schockaert.
Modelling Semantic Categories using Conceptual Neighborhood. [paper]
AAAI 2020, New York, USA.
 
Jose Camacho-Collados*, Yerai Doval*, Eugenio Martínez-Cámara, Luis Espinosa-Anke, Francesco Barbieri and Steven Schockaert.
Learning Cross-lingual Embeddings from Twitter via Distant Supervision. [paper] [data]
ICWSM 2020, Atlanta, USA.
 
Tommaso Pasini and Jose Camacho-Collados.
A Short Survey on Sense-Annotated Corpora for Diverse Languages and Resources. [paper]
LREC 2020, Marseille, France.
 
Yerai Doval*, Jose Camacho-Collados*, Luis Espinosa-Anke and Steven Schockaert.
On the robustness of unsupervised and semi-supervised cross-lingual word embedding learning. [paper]
LREC 2020, Marseille, France.
 

2019

Jose Camacho-Collados, Luis Espinosa-Anke and Steven Schockaert.
Relational Word Embeddings. [paper] [data&code]
ACL 2019, Florence, Italy.
 
Jose Camacho-Collados, Luis Espinosa-Anke, Shoaib Jameel and Steven Schockaert.
A Latent Variable Model for Learning Distributional Relation Vectors. [paper] [data&code]
IJCAI 2019, Macau, China.
 
Carlos Perelló, David Tomás, Alberto Garcia-Garcia, Jose Garcia-Rodriguez and Jose Camacho-Collados.
UA at SemEval-2019 Task 5: Setting A Strong Linear Baseline for Hate Speech Detection. [paper]
SemEval 2019, Minneapolis, USA.
 
Mohammad Taher Pilehvar and Jose Camacho-Collados.
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations. [paper] [data] [competition]
NAACL 2019 (short), Minneapolis, USA.
 
Roberta Sinoara, Jose Camacho-Collados, Rafael G. Rossi, Roberto Navigli and Solange O. Rezende.
Knowledge-enhanced document embeddings for text classification. [paper] [data]
Knowledge-Based Systems (2019).
 
Jose Camacho-Collados, Claudio Delli Bovi, Alessandro Raganato and Roberto Navigli.
SenseDefs: a multilingual corpus of semantically annotated textual definitions. [paper] [data]
Language Resources and Evaluation (2019).
 

2018

Jose Camacho-Collados and Mohammad Taher Pilehvar.
From Word to Sense Embeddings: A Survey on Vector Representations of Meaning. [paper]
Journal of Artificial Intelligence Research (2018).
 
Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke and Steven Schockaert.
Improving Cross-Lingual Word Embeddings by Meeting in the Middle. [paper] [data&code]
EMNLP 2018, Brussels, Belgium.
 
Francesco Barbieri, Luis Espinosa-Anke, Jose Camacho-Collados, Steven Schockaert and Horacio Saggion.
Interpretable Emoji Prediction via Label-Wise Attention LSTMs. [paper] [data&code]
EMNLP 2018 (short), Brussels, Belgium.
 
Jose Camacho-Collados and Mohammad Taher Pilehvar.
On the Role of Text Preprocessing in Neural Network Architectures: An Evaluation Study on Text Categorization and Sentiment Analysis. [paper] [data&code]
EMNLP 2018 Workshop on Analyzing and interpreting neural networks for NLP, Brussels, Belgium.
 
Jose Camacho-Collados, Claudio Delli Bovi, Luis Espinosa-Anke, Sergio Oramas, Tommaso Pasini, Enrico Santus, Vered Shwartz, Roberto Navigli and Horacio Saggion.
SemEval-2018 Task 9: Hypernym Discovery. [paper] [website]
SemEval 2018, New Orleans, Louisiana, United States.
 
Francesco Barbieri, Jose Camacho-Collados, Francesco Ronzano, Luis Espinosa-Anke, Miguel Ballesteros, Valerio Basile, Viviana Patti and Horacio Saggion.
SemEval-2018 Task 2: Multilingual Emoji Prediction. [paper] [website]
SemEval 2018, New Orleans, Louisiana, United States.
 
Francesco Barbieri and Jose Camacho-Collados.
How Gender and Skin Tone Modifiers Affect Emoji Semantics in Twitter. [paper] [data&code]
*SEM 2018, New Orleans, Louisiana, United States.
 
Lara Quijano-Sánchez, Federico Liberatore, Jose Camacho-Collados and Miguel Camacho-Collados.
Applying automatic text-based detection of deceptive language to police reports: extracting behavioral patterns from a multi-step classification model to understand how we lie to the Police. [paper] [bib]
Knowledge-Based Systems (2018).
Research award (1st) from the Spanish police foundation  
Jose Camacho-Collados.
Semantic Vector Representations of Word Senses, Concepts and Entities and their Applications in Natural Language Processing. [thesis]
PhD Thesis (2018).
PhD thesis  

2017

Massimiliano Mancini*, Jose Camacho-Collados*, Ignacio Iacobacci and Roberto Navigli.
Embedding Words and Senses Together via Joint Knowledge-Enhanced Training. [paper] [bib] [data&code]
CoNLL 2017, Vancouver, Canada.
 
Mohammad Taher Pilehvar, Jose Camacho-Collados, Roberto Navigli and Nigel Collier.
Towards a Seamless Integration of Word Senses into Downstream NLP Applications. [paper] [bib] [data&code]
ACL 2017, Vancouver, Canada.
 
Claudio Delli Bovi, Jose Camacho-Collados, Alessandro Raganato and Roberto Navigli.
EuroSense: Automatic Harvesting of Multilingual Sense Annotations from Parallel Text. [paper] [bib] [data]
ACL 2017 (short), Vancouver, Canada.
 
Jose Camacho-Collados*, Mohammad Taher Pilehvar*, Nigel Collier and Roberto Navigli.
SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity. [paper] [bib] [website]
SemEval 2017, Vancouver, Canada.
 
Jose Camacho-Collados.
Why we have switched from building full-fledged taxonomies to simply detecting hypernymy relations. [paper]
arXiv preprint arXiv:1703.04178 (2017).
 
Jose Camacho-Collados and Roberto Navigli.
BabelDomains: Large-Scale Domain Labeling of Lexical Resources. [paper] [data]
EACL 2017 (short), Valencia, Spain.
 
Alessandro Raganato, Jose Camacho-Collados and Roberto Navigli.
Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison. [paper] [website] [slides]
EACL 2017, Valencia, Spain.
 

2016

Luis Espinosa-Anke, Jose Camacho-Collados, Sara Rodríguez-Fernández, Horacio Saggion and Leo Wanner.
Extending WordNet with Fine-Grained Collocational Information via Supervised Distributional Learning. [paper] [bib] [data&code]
COLING 2016, Osaka, Japan, pp. 3422-3432.
 
Alessandro Raganato, Jose Camacho-Collados, Antonio Raganato and Yunseo Joung.
Semantic Indexing of Multilingual Corpora and its Application on the History Domain. [paper] [bib] [data&interface]
COLING 2016 Workshop on Language Technology Resources and Tools for Digital Humanities, Osaka, Japan, pp. 140-147.
 
Jose Camacho-Collados, Mohammad Taher Pilehvar and Roberto Navigli.
Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities. [paper] [bib] [website]
Artificial Intelligence Journal (2016), 240, pp. 36-64.
Best PhD paper award 2016 in the Computer Science Department of Sapienza University  
Luis Espinosa-Anke, Jose Camacho-Collados, Claudio Delli Bovi and Horacio Saggion.
Supervised Distributional Hypernym Discovery via Domain Adaptation. [paper] [bib] [data&code] [poster]
EMNLP 2016, Austin, USA, pp. 424-435.
 
Luis Espinosa-Anke, Sergio Oramas, Jose Camacho-Collados, and Horacio Saggion.
Finding and Expanding Hypernymic Relations in the Music Domain. [paper]
CCIA 2016, Barcelona, Spain, pp. 291-296.
Best Poster award in CCIA 2016  
Jose Camacho-Collados and Roberto Navigli.
Find the word that does not belong: A Framework for an Intrinsic Evaluation of Word Vector Representations. [paper] [bib] [data&code]
RepEval, ACL 2016, Berlin, Germany, pp. 43-50.
 
Jose Camacho-Collados, Claudio Delli Bovi, Alessandro Raganato and Roberto Navigli.
A Large-Scale Multilingual Disambiguation of Glosses. [paper] [bib] [data] [slides]
LREC 2016, Portoroz, Slovenia, pp. 1701-1708.
 

2015

Jose Camacho-Collados, Mohammad Taher Pilehvar and Roberto Navigli.
A Unified Multilingual Semantic Representation of Concepts. [paper] [bib] [website]
ACL 2015, Beijing, China, pp. 741-751.
 
Jose Camacho-Collados, Mohammad Taher Pilehvar and Roberto Navigli.
A Framework for the Construction of Monolingual and Cross-lingual Word Similarity Datasets. [paper] [bib] [data] [video]
ACL 2015 (short), Beijing, China, pp. 1-7.
 
Jose Camacho-Collados, Mohammad Taher Pilehvar and Roberto Navigli.
NASARI: a Novel Approach to a Semantically-Aware Representation of Items. [paper] [bib] [website]
NAACL 2015, Denver, USA, pp. 567-577.
 

2014

Mokhtar Billami, Jose Camacho-Collados, Evelyne Jacquey, and Laurence Kister.
Annotation sémantique et validation terminologique en texte intégral en SHS. [paper]
TALN 2014, Marseille, France.
 
Jose Camacho-Collados, Mokhtar Billami, Evelyne Jacquey, and Laurence Kister.
Approche statistique pour le filtrage terminologique des occurrences de candidats termes en texte intégral. [paper]
JADT 2014, Paris, France.
 

2013

Jose Camacho-Collados.
Syntactic Simplification for Machine Translation. [thesis]
Master thesis (2013).
PhD thesis  
Jose Camacho-Collados
Syntactic Simplification for Machine Translation. [paper]
BULAG 2013
 
Jose Camacho-Collados
Splitting complex sentences for Natural Language Processing applications: Building a Simplified Spanish Corpus [paper]
CILC 2013, Alicante, Spain.