Projects
Research Projects & Portfolio
Here are some of my key research projects and applications, demonstrating expertise in applied economics, machine learning, and data science.
Featured Research
Sentiment Volume Forecasting
Machine Learning for Financial Markets
Research project investigating the usefulness of news sentiment in predicting intraday stock trading volume for U.S. airline stocks. This comprehensive study demonstrates:
- Scale: Analysis of 1.3 million news articles from GDELT database
- Real-time Processing: 15-minute increment predictions during trading hours
- Advanced ML: TensorFlow, PyTorch, and ensemble methods
- Performance: Results comparable with state-of-the-art literature
- Period: January 2018 to May 2025 data coverage
Technologies: Python, TensorFlow, PyTorch, pandas, GDELT API, real-time data processing
Key Finding: While sentiment features provide marginal improvement in forecasting accuracy, the framework demonstrates the potential for real-time business monitoring applications.
Super Search
Semantic Document Indexing Tool
Advanced semantic indexing application for PDF repositories using state-of-the-art natural language processing. Features include:
- Semantic Search: Beyond keyword matching using transformer models
- Large-scale Processing: Efficient handling of document collections
- NLP Pipeline: Advanced text processing and embeddings
- User Interface: Simple GUI for search and retrieval
Technologies: Python, sentence-transformers, Hugging Face, NLP, document processing
Application: Enables intelligent document search for research and business applications.
Labor Economics Research
Applied Econometric Analysis
Comprehensive econometric analysis examining labor market dynamics and policy impacts using modern microeconometric methods.
Focus Areas:
- Employment and wage determination
- Policy evaluation using difference-in-differences
- Causal inference in labor markets
- Statistical modeling and hypothesis testing
Technologies: R, econometric modeling, statistical analysis, data visualization
Development Economics
Economic Development Analysis
Research on economic development topics using quantitative methods and empirical analysis.
Approach:
- Jupyter notebook-based analysis
- Modern econometric techniques
- Policy-relevant research questions
- Data-driven insights
Technologies: Jupyter Notebooks, Python, statistical analysis, economic modeling
Forecasting & Predictive Analytics
Forecasting Projects
Predictive Analytics Portfolio
Multiple projects demonstrating expertise in forecasting and predictive modeling:
- Traditional Methods: ARIMA, exponential smoothing, time series analysis
- Machine Learning: Neural networks, gradient boosting (LightGBM)
- Ensemble Methods: Model combination and performance optimization
- Real-world Applications: Business forecasting and decision support
Technologies: R, Python, neural networks, LightGBM, time series analysis
Research Replication
Academic Research Standards
Replication materials for academic research demonstrating:
- Methodological rigor and reproducibility
- LaTeX document preparation
- Academic writing and presentation
- Research transparency and open science practices
Technologies: LaTeX, academic research methods, statistical replication
Technical Tools & Utilities
Stay Awake
Python Utility
Simple Python script to keep computers awake during long-running processes. Demonstrates practical problem-solving and tool development skills.
Technologies: Python, system utilities, automation
Research Philosophy
My projects reflect a commitment to:
- Methodological Rigor: Applying appropriate statistical and econometric methods
- Reproducibility: Open-source code and transparent research practices
- Real-world Relevance: Addressing practical problems with academic rigor
- Technical Excellence: Using state-of-the-art tools and methods
- Interdisciplinary Approach: Combining economics, computer science, and statistics
Future Directions
I continue to work on projects at the intersection of economics and data science, with particular interest in:
- Real-time economic monitoring using big data
- Machine learning applications in policy evaluation
- Advanced econometric methods for causal inference
- Open-source tools for economic research
All projects are available on GitHub with open-source licenses encouraging reuse and collaboration.