- Home
- Vacatures
- Vacatures Veenendaal
- Vacaturedetails
Vacature doormailen
Masters Thesis in Data & AI: Privacy preserving RAG Veenendaal • Info Support
- Vacature rapporteren
Gevraagd
-
37 - 40 uur
Aanbod
-
Loondienst (vast)
-
1.000 p/m (bruto)
-
Auto v/d zaak
Vacature in het kort
Over het bedrijf
Volledige vacaturetekst
Challenging assignment with €1000 compensation or €500 + lease car or €600 + housing, professional guidance, training sessions, knowledge events, brainstorming with colleagues and 2 vacation days p/m.Privacy is a critical challenge in deploying Retrieval-Augmented Generation (RAG) systems in sensitive domains. This thesis investigates how privacy-preserving techniques, such as differential privacy and synthetic data, can be integrated into RAG pipelines without degrading output quality. You will analyze trade-offs, enhance a promising method, and validate your approach with a Proof of Concept focused on real-world utility and privacy guarantees.
ð¡Areas of Interest: Information retrieval, AI, data privacy, NLP, differential privacy
Retrieval-Augmented Generation (RAG) systems enhance large language models (LLMs) by incorporating related external knowledge into prompts. This mitigates hallucinations and improves output quality, especially when the information falls outside the model’s original training data. However, RAG systems currently offer no guarantees that privacy-sensitive content will remain protected in their outputs, posing significant compliance and ethical risks. Consequently, such sources are often excluded from RAG applications, limiting their effectiveness in privacy-critical sectors like healthcare, legal services, finance, and government. To fully leverage RAG's potential in these domains, we need robust, scalable methods to preserve privacy without compromising performance. This thesis addresses the challenge of preserving privacy in RAG systems.
The Assignment
Your research will include two components:
- Literature Study
- Review state-of-the-art methods for privacy-preserving RAG.Focus areas include:
- Differentially Private In-Context Learning (e.g., DP-ICL2)
- Synthetic document generation (e.g., SAGE)
- Private fine-tuning (e.g., DP-SGD, masking techniques)
- Analyze trade-offs between privacy guarantees and model utility.
- Proof of Concept (PoC)
- Select one promising technique and enhance it.
- Ensure your improvement addresses gaps identified in the literature.
- Build and evaluate a PoC integrating your privacy method into a RAG pipeline.
- Evaluation metrics:
- Privacy: Differential Privacy parameters (ε, δ)
- Utility: Accuracy, BLEU/ROUGE scores, latency
Research Question
You will start with the following broad research question, which you can tailor to your most promising approach later on.
"How can privacy be preserved in Retrieval-Augmented Generation systems without sacrificing model utility?"
Materials
- Baseline project:
Paper: RAG with Differential Privacy
Medium article:
- Paper: Privacy-Preserving In-context Learning with Differentially Private Few-shot Generation:
- Paper: Mitigating the Privacy Issues in Retrieval-Augmented Generation (RAG) via Pure Synthetic Data
About Info Support
Info Support specializes in custom software, data/AI solutions, management, and training and is active in the Finance, Industry, Agriculture, Food & Retail, Mobility & Public, and Healthcare sectors. We provide solid and innovative solutions for complex and critical software issues. Our headquarters are located in Veenendaal (NL) and Mechelen (BE). At present, approximately 500 employees are employed by Info Support.
Info Support's working method is characterized by a number of core values: solidity, integrity, craftsmanship, and passion. These core values are intertwined in our work and the way we interact with each other.
To ensure that all employees are always up to date with the latest developments, Info Support has an in-house knowledge center that eagerly satisfies the hunger for more or different knowledge and skills.
B2 language proficiency in Dutch is required.
Gerelateerde zoekopdrachten
Fulltime VeenendaalLoondienst (vast) VeenendaalVeenendaalProvincie UtrechtVanaf nu ontvang je automatisch de best passende vacatures automatisch in je mailbox.
Jouw inschrijving
Emailadres:
Functie:
Plaats:
Frequentie:
Wijzig je inschrijving
Ontvang als eerste nieuwe vacatures in Veenendaal
Vind werknemers in Veenendaal op Werkzoeken.nl