research / personal projects

Research & Projects

Applied research at the intersection of cybersecurity, AI, and social media analysis, with a focus on harmful content detection, political influence modeling, and language technologies for underrepresented languages.

Predict Election Outcomes via Social Media Analysis

End-to-end AI system • NLP • Data Scraping • Visualization

Built an automated scraping pipeline using Puppeteer + TypeScript to collect large-scale Facebook post data.
Developed NLP models in Python to classify political vs non-political content.
Triggered secondary scraping for political posts to extract reactions, shares, comments, and engagement metrics.
Designed a weight scoring algorithm to rank political influence using engagement and sentiment.
Engineered a MongoDB schema (Post, FullPost, PoliticalPost) to manage processing stages.
Created an interactive dashboard using Nuxt.js + Tailwind for trend visualization.
Integrated everything through a Flask REST API for automation.

Source Code ↗ Political Model ↗ Candidate Model ↗

Tools & Techniques to Combat Cyber Radicalization (Sri Lanka)

NLP • Deep Learning • Image Analysis • Social Platforms

Developed a radicalization detection system using NLP, deep learning, and ML.
Collected real-world datasets from blogs, deep web sources, Twitter, Reddit, and YouTube.
Implemented sentiment and behavior analysis for early radicalization signals.
Built an image-based radical content detector using deep learning.
Visualized insights using an interactive Streamlit dashboard.

Research Prototype

Sinhala & Tamil RoBERTa Language Models

Language Models • Low-resource NLP

Pre-trained RoBERTa models for Sinhala and Tamil using Masked Language Modeling (MLM).
Trained on the custom-made dataset to support downstream NLP tasks.
Focused on enabling NLP research for low-resource languages.

Sinhala RoBERTa ↗ Tamil RoBERTa ↗

Asian Faces Classification Model

Computer Vision • Deep Learning • Image Classification

A deep learning–based image classification model focused on facial image analysis within Asian demographics. The project explores feature learning, dataset bias considerations, and ethical aspects of facial classification systems.

Built and trained a convolutional neural network (CNN) for facial image classification.
Applied preprocessing, normalization, and augmentation techniques to improve robustness.
Evaluated model performance using accuracy and validation metrics.
Explored ethical considerations and bias awareness in facial recognition systems.

HuggingFace Model ↗ Try Model ↗