Parsing USPTO Patents to Create a Massive Free Labeled Dataset

Arguably one of my favorite (and best) labeled text datasets are patents at the United States Patent and Trademark Office (USPTO). Every patent is freely available with labeled images, abstract, claims, a long description, authors, dates, classification labels, etc. Data in the provided format can be used for a lot of natural language processing (NLP) […]

Backtesting Our 100% YoY Profit Generating Strategy

This is a follow-up of the How I Made Over 100% YoY Profit Investing article, since I felt it was prudent to add some additional details. Extraordinary claims require extraordinary evidence – Carl Sagan Specifically, this article is on the backtesting results of the rudimentary (but still effective) algorithm currently on ProjectPiglet.com. Although back-testing has […]

Neural Networks to Production, From an Engineer

Deep learning is a pretty hot topic. Unfortunately, the vast majority of engineers and data scientists don’t have practical experience. However, you’re in luck! I wrote a guide on solving a problem from start to finish using neural networks, the same way I would approach and solve the problem. Who am I, you ask? I’m […]