Posts by Collection

portfolio

Mixture-of-Experts Implementation Permalink

Implemented a Switch Transformer alongside a conventional autoregressive transformer and trained on TinyShakespeare to research effects of mixture-of-experts architecture on validation loss, sample-efficiency and training time.

Art Generation with GANs Permalink

Implemented and compared DCGANs and Creative Adversarial Network (CAN)s to generate paintings, performed hyperparameter tuning and metric evaluations, developed interactive Streamlit demo.

LLMs for Question Answering Permalink

Fine-tuned encoder-decoder model T5 with LoRA for extractive question answering on reading comprehension dataset SQuAD v1.1.

TinyTransformer Permalink

Built and trained decoder transformer models in parallel from scratch on TinyStories, investigated scaling laws of dataset and model size with validation loss and created story generation demo.

Machine Unlearning Permalink

Implemented common machine unlearning (MU) algorithms such as zero-glance and zero-shot unlearning via error-maximizing noise and gated knowledge transfer respectively, selective synaptic dampening (SSD), incompetent teacher unlearning, etc., exploring the nascent field of MU and its applications in the right to be forgotten, debiasing, influence functions and model interpretability, and more.

publications

Published in , 1900

Download here

talks

October 24, 2023

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015