Rchneeta
Sdahcave

Ph.D. Student @ UKP Lab, TU Darmstadt

Continue

instag

AI Safety

Turning Logic Against Itself:
Probing Model Defenses Through Contrastive Questions

Jailbreak attack to bypass LLM safety mechanisms.

Paper

LLM Hallucinations

Localizing and Mitigating Errors in Long-form Question Answering

Fine-grained error evaluation of long-form LLM responses.

Paper

Emergent Abilities

Are Emergent Abilities in Large Language Models just In-Context Learning?

Evaluation of the hype around LLM abilities..

Paper

Explainable AI

CATfOOD: Counterfactual augmented training for improving out-of-domain performance and calibration

Counterfactual augmentation of data to improve out-of-domain generalization of models.

Paper

Explainable AI

UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA

Platform to interpret and explain the predictions of machine learning models.

Paper

Question Answering Platform

UKP-SQUARE: An Online Platform for Question Answering Research

An online QA platform which allows users to query, compare and evaluate different models.

Paper

© 2025
Rachneet Sachdeva