Kaede
Passionate about Machine Learning and
AI applications, building real-world projects.
Projects
Predicting Part of Speech and Next Word in Scottish Gaelic
Overview:
Developed two neural network models for Scottish Gaelic, a low-resource language, in collaboration with Edinburgh Napier University.
Part-of-Speech Tagger
- Built a BiLSTM-based neural network to tag 87K tokens from the ARCOSG dataset
- Achieved 89% accuracy across 223 unique PoS tags
- Visualized architecture with input, hidden, and output layers
Next-Word Prediction (Small Language Model)
- Preprocessed and trained on 1M tokens using an LSTM model from the Institutional Books Corpus
- Predicted next words in Gaelic phrases
Tools:
PyTorch, FastText, spaCy, Jupyter Notebook, Python
Future Plans:
- Scale training data from 1M to 40M tokens for improved predictions
- Integrate the PoS model into spaCy for open-source usage
CompassQA – AI-powered Student Policy Assistant
Built a RAG system that enables students to ask questions about Cornell College’s student policies, including academic, residential, and financial rules.
Tools:
LangChain, Ollama, Streamlit
Skills
Programming Languages:
Python, R, Swift, SQL (MySQL), HTML/CSS
ML & NLP:
PyTorch, Keras, scikit-learn, spaCy, LangChain, Ollama, TensorFlow, Pandas, NumPy, matplotlib
Developer Tools:
Git, Xcode, RStudio, VS Code, Tableau
Operating Systems:
macOS, Windows
Certifications
Machine Learning with Python
Introduction to Deep Learning & Neural Networks with Keras
Generative AI and LLMs: Architecture and Data Preparation
Gen AI Foundational Models for NLP & Language Understanding
Programming Fundamentals in Swift by Meta
Introduction to iOS Mobile Application Development by Meta