8-K SEC Filing Scraper
Work done during Summer 2023 machine learning engineering internship with Intrinio
Project overview:
- Designed, developed, and tested an ML (RNN/LSTM) model for predicting key financial metrics (e.g. EBITDA), achieving 70% accuracy on real market data; used technologies such as pandas, NumPy, and PyTorch.
- Used pandas and Beautiful Soup to build an end-to-end data pipeline that automatically parses 8-K SEC filings and standardizes/extracts information which provided new data from over 1,000 publicly traded companies.