Oscar Sainz

Postdoctoral Researcher @ University of the Basque Country
Member of Ixa group & HiTZ center

Email: oscar.sainz@ehu.eus

About me

Greetings, I'm Oscar!

I am a postdoctoral researcher at the University of the Basque Country in the HiTZ center and IXA group. My research interests are focused on low-resource scenarios and particularly on low-resource Information Extraction. I obtained my PhD at the University of the Basque Country in 2024. I worked on zero and few-shot Information Extraction using entailment models and making LLMs learn to follow annotation guidelines.

Additional research interests include cross-lingual evaluations, LLMs for low-resource languages (particularly Basque), and data contamination in LLMs.

  • Publications
  • Students
  • Other activities
  • Grants & Awards
  • Career
  • Slides
  • Posters
  • Models

Recent publications

Showing last 10 publications. For more visit my Google Scholar or Semantic Scholar page.

Students

  • Mikel Zubillaga (2022-2023): BSc student at the University of the Basque Country, co-advised with Oier Lopez de Lacalle.

Other activities

Grants and Awards

  • Best Reviewer Award at the EMNLP 2023 in the Information Extraction track
  • Predoc-berri PhD grant by the Basque Government
  • IKASIKER collaboration grant by the Basque Government

Career

 
 
 
 
 
2024-present
Postdoctoral Researcher

University of the Basque Country (UPV/EHU)
Hitz Center for Language Technologies - Ixa group

 
 
 
 
 
2020-2024
Ph.D. in Natural Language Processing

University of the Basque Country (UPV/EHU)
Hitz Center for Language Technologies - Ixa group

 
 
 
 
 
2019-2020
M.S. in Language Analyzing and Processing

University of the Basque Country (UPV/EHU)

Grade: 9.26 / 10

 
 
 
 
 
2018-2020
Research Internship

Ixa research group
IKASIKER collaboration grant

 
 
 
 
 
2015-2019
B.S. in Computer Science

University of the Basque Country (UPV/EHU)

Grade: 8.06 / 10

Code

hitz-zentroa/GoLLIE

Guideline following Large Language Model for Information Extraction

osainz59/Ask2Transformers

A Framework for Textual Entailment based Zero Shot text classification

hitz-zentroa/lm-contamination

The LM Contamination Index is a manually created database of contamination evidences for LMs.

osainz59/t5-encoder

A extension of Transformers library to include T5ForSequenceClassification class.