All posts
Machine Learning
- - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a an implementation of Mixed Negative Sampling and a comparion to previous methods in this series of posts.
- - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a sketch of the proof of the LogQ correction for in-batch negatives.
- - Learning RecSys through Papers- Implementing a Candidate Generation Model, A modern-ish implementation of the candidate generation step of the "Deep Neural Networks For YouTube Recommendations" by Covington et al. with a discussion of next steps from other papers.
- - Using Ordinary Differential Equations To Design State of the Art Residual-Style Layers, A walkthrough of the theory behind Residual layers with comments on current research.
- - Learning About Deep Reinforcement Learning (Slides), My slides from a talk for Data Philly on Deep Reinforcement Learning.
- - Understanding Attention in Neural Networks Mathematically, Explains the Attention Mechanism's equations and demonstrates them geometrically and probabilistically.
- - Adversarial Dreaming with TensorFlow and Keras, Details a loss function to perform adversarial dreaming in Python.
- - Hogwild!? Implementing Async SGD in Python, Explains the Hogwild! algorithm and walks through an implementation using the multiprocessing library.
- - Covariate Shift, i.e. Why Prediction Quality Can Degrade In Production and How To Fix It, Describes the Kullback-Leibler Importance Estimation Procedure and introduces a python package to use it.
- - An Annotated Proof of Generative Adversarial Networks with Implementation Notes, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
- - A Tour of Gotchas When Implementing Deep Q Networks with Keras and OpenAi Gym, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
- - Visualizing the Learning of a Neural Network Geometrically, Walking through how to visualize the training process of a neural network.
- - Dealing with Trends. Combine a Random Walk with a Tree-Based Model to Predict Time Series Data, Using statistics and machine learning for time series data.
- - (My Opinion of) Best Practices for a Data Scientist in Industry, Giving unsolicited advice to upcoming Data Scientists.
- - Leveraging Factorization Machines for Wide Sparse Data and Supervised Visualization, A nonlinear model and visualization approach great for sparse data.
- - Train a Neural Network to Play Black Jack with Q Learning, It turns out the traditional "stay at 15" is almost the best strategy.
- - Jitter, Convolutional Neural Networks, and a Kaggle Framework, A recipe for approaching Kaggle competitions.
Practical
- - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a an implementation of Mixed Negative Sampling and a comparion to previous methods in this series of posts.
- - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a sketch of the proof of the LogQ correction for in-batch negatives.
- - Learning RecSys through Papers- Implementing a Candidate Generation Model, A modern-ish implementation of the candidate generation step of the "Deep Neural Networks For YouTube Recommendations" by Covington et al. with a discussion of next steps from other papers.
- - Calculating Statistical Power When Your Analysis Requires the Delta Method, Demonstration of a statistical power calculation when the variable in question requires the use of the delta method-- i.e., it is a ratio metric.
- - CUPED with Multiple Covariates and A Simpler the Delta Method Calculation, Theoretical derivation of the formula for CUPED with multiple covariates and a calculation trick to make the delta method tractable for this situation.
- - Connections Between the Delta Method, OLS and CUPED, Illustrated, Empirical evidence of the equivalence of OLS and CUPED, plus special considerations for page-level metrics.
- - Extending the Gaussian Mixture Approach for Fantasy Football Tiering, An extention to the Gaussian Mixture approach to tiering Players
- - A Tour of Gotchas When Implementing Deep Q Networks with Keras and OpenAi Gym, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
- - (My Opinion of) Best Practices for a Data Scientist in Industry, Giving unsolicited advice to upcoming Data Scientists.
- - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence, How to make fantasy football projections better with Bayesian techniques.
- - Making Fantasy Football Projections Via A Monte Carlo Simulation, How to make fantasy football projections with historic data and Monte Carlo techniques.
- - Build Your Own Event-Based Backtester in Python, Use multiprocessing to speed up your backtesting!
- - Parsing HTML Tables in Python with BeautifulSoup and pandas, How to use BeautifulSoup and pandas to grab data from the web.
- - Detect Communities in Your Steam Friends List with the Steam Web API and Graph Theory, Using discrete math to analyze your Steam friends.
- - Train a Neural Network to Play Black Jack with Q Learning, It turns out the traditional "stay at 15" is almost the best strategy.
- - Making a Markov Chain Twitter Bot in Python, For those of you who want to spam their followers with gibberish.
- - Eigen-vesting III. Random Matrix Filtering in Finance, Part three in a Series on How Math Fits in Modern Portfolio Theory
- - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3., A lightning tour of PyMC3 and Bayesian inference to solve (somtimes frustrating or impossible) pen and paper problems.
- - Eigen-vesting II. Optimize Your Portfolio With Optimization, Part Two in a Series on How Math Fits in Modern Portfolio Theory
- - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio, A Series on How Math Fits in Modern Portfolio Theory
- - How to Use Math to Win at Fantasy Football With a Lineup Optimizer, All the math in the world won't guarantee a win, but it could help.
- - Jitter, Convolutional Neural Networks, and a Kaggle Framework, A recipe for approaching Kaggle competitions.
Blog
- - Blogging with iPython using Jekyll, Converting code to blogs with iPython.
Math
- - Extending the Gaussian Mixture Approach for Fantasy Football Tiering, An extention to the Gaussian Mixture approach to tiering Players
- - Leveraging Factorization Machines for Wide Sparse Data and Supervised Visualization, A nonlinear model and visualization approach great for sparse data.
- - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence, How to make fantasy football projections better with Bayesian techniques.
- - Making Fantasy Football Projections Via A Monte Carlo Simulation, How to make fantasy football projections with historic data and Monte Carlo techniques.
- - Why Blurring an Image is Similar to Warming Your Coffee, Showing the connection between the heat equation and the Gaussian blur.
- - On Solving Partial Differential Equations with Brownian Motion in Python, When random walks solve deterministic equations
- - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods, Part four in a Series on How Math Fits in Modern Portfolio Theory
- - Making a Markov Chain Twitter Bot in Python, For those of you who want to spam their followers with gibberish.
- - Eigen-vesting III. Random Matrix Filtering in Finance, Part three in a Series on How Math Fits in Modern Portfolio Theory
- - Connect The Dots. Least Squares, Linear Regression, and Bayesian Regression, Sometimes it helps to draw a line or two.
- - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3., A lightning tour of PyMC3 and Bayesian inference to solve (somtimes frustrating or impossible) pen and paper problems.
- - Eigen-vesting II. Optimize Your Portfolio With Optimization, Part Two in a Series on How Math Fits in Modern Portfolio Theory
- - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio, A Series on How Math Fits in Modern Portfolio Theory
- - How to Use Math to Win at Fantasy Football With a Lineup Optimizer, All the math in the world won't guarantee a win, but it could help.
Sports
- - Extending the Gaussian Mixture Approach for Fantasy Football Tiering, An extention to the Gaussian Mixture approach to tiering Players
- - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence, How to make fantasy football projections better with Bayesian techniques.
- - Making Fantasy Football Projections Via A Monte Carlo Simulation, How to make fantasy football projections with historic data and Monte Carlo techniques.
- - How to Use Math to Win at Fantasy Football With a Lineup Optimizer, All the math in the world won't guarantee a win, but it could help.
Finance
- - Build Your Own Event-Based Backtester in Python, Use multiprocessing to speed up your backtesting!
- - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods, Part four in a Series on How Math Fits in Modern Portfolio Theory
- - Eigen-vesting III. Random Matrix Filtering in Finance, Part three in a Series on How Math Fits in Modern Portfolio Theory
- - Eigen-vesting II. Optimize Your Portfolio With Optimization, Part Two in a Series on How Math Fits in Modern Portfolio Theory
- - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio, A Series on How Math Fits in Modern Portfolio Theory
Theory
- - Using Ordinary Differential Equations To Design State of the Art Residual-Style Layers, A walkthrough of the theory behind Residual layers with comments on current research.
- - Learning About Deep Reinforcement Learning (Slides), My slides from a talk for Data Philly on Deep Reinforcement Learning.
- - Understanding Attention in Neural Networks Mathematically, Explains the Attention Mechanism's equations and demonstrates them geometrically and probabilistically.
- - Adversarial Dreaming with TensorFlow and Keras, Details a loss function to perform adversarial dreaming in Python.
- - Hogwild!? Implementing Async SGD in Python, Explains the Hogwild! algorithm and walks through an implementation using the multiprocessing library.
- - Covariate Shift, i.e. Why Prediction Quality Can Degrade In Production and How To Fix It, Describes the Kullback-Leibler Importance Estimation Procedure and introduces a python package to use it.
- - An Annotated Proof of Generative Adversarial Networks with Implementation Notes, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
- - Visualizing the Learning of a Neural Network Geometrically, Walking through how to visualize the training process of a neural network.
- - Dealing with Trends. Combine a Random Walk with a Tree-Based Model to Predict Time Series Data, Using statistics and machine learning for time series data.
- - Leveraging Factorization Machines for Wide Sparse Data and Supervised Visualization, A nonlinear model and visualization approach great for sparse data.
- - Why Blurring an Image is Similar to Warming Your Coffee, Showing the connection between the heat equation and the Gaussian blur.
- - On Solving Partial Differential Equations with Brownian Motion in Python, When random walks solve deterministic equations
- - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods, Part four in a Series on How Math Fits in Modern Portfolio Theory
- - Train a Neural Network to Play Black Jack with Q Learning, It turns out the traditional "stay at 15" is almost the best strategy.
- - Making a Markov Chain Twitter Bot in Python, For those of you who want to spam their followers with gibberish.
- - Eigen-vesting III. Random Matrix Filtering in Finance, Part three in a Series on How Math Fits in Modern Portfolio Theory
- - Connect The Dots. Least Squares, Linear Regression, and Bayesian Regression, Sometimes it helps to draw a line or two.
- - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3., A lightning tour of PyMC3 and Bayesian inference to solve (somtimes frustrating or impossible) pen and paper problems.
- - Eigen-vesting II. Optimize Your Portfolio With Optimization, Part Two in a Series on How Math Fits in Modern Portfolio Theory
- - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio, A Series on How Math Fits in Modern Portfolio Theory
Statistics
- - Calculating Statistical Power When Your Analysis Requires the Delta Method, Demonstration of a statistical power calculation when the variable in question requires the use of the delta method-- i.e., it is a ratio metric.
- - CUPED with Multiple Covariates and A Simpler the Delta Method Calculation, Theoretical derivation of the formula for CUPED with multiple covariates and a calculation trick to make the delta method tractable for this situation.
- - Connections Between the Delta Method, OLS and CUPED, Illustrated, Empirical evidence of the equivalence of OLS and CUPED, plus special considerations for page-level metrics.
- - Dealing with Trends. Combine a Random Walk with a Tree-Based Model to Predict Time Series Data, Using statistics and machine learning for time series data.
- - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence, How to make fantasy football projections better with Bayesian techniques.
- - Making Fantasy Football Projections Via A Monte Carlo Simulation, How to make fantasy football projections with historic data and Monte Carlo techniques.
- - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods, Part four in a Series on How Math Fits in Modern Portfolio Theory
- - Making a Markov Chain Twitter Bot in Python, For those of you who want to spam their followers with gibberish.
- - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3., A lightning tour of PyMC3 and Bayesian inference to solve (somtimes frustrating or impossible) pen and paper problems.
Reinforcement Learning
- - A Tour of Gotchas When Implementing Deep Q Networks with Keras and OpenAi Gym, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
- - Train a Neural Network to Play Black Jack with Q Learning, It turns out the traditional "stay at 15" is almost the best strategy.
Python
- - Why Blurring an Image is Similar to Warming Your Coffee, Showing the connection between the heat equation and the Gaussian blur.
Data
- - (My Opinion of) Best Practices for a Data Scientist in Industry, Giving unsolicited advice to upcoming Data Scientists.
- - Parsing HTML Tables in Python with BeautifulSoup and pandas, How to use BeautifulSoup and pandas to grab data from the web.
- - Detect Communities in Your Steam Friends List with the Steam Web API and Graph Theory, Using discrete math to analyze your Steam friends.
A/B Testing
- - Calculating Statistical Power When Your Analysis Requires the Delta Method, Demonstration of a statistical power calculation when the variable in question requires the use of the delta method-- i.e., it is a ratio metric.
- - CUPED with Multiple Covariates and A Simpler the Delta Method Calculation, Theoretical derivation of the formula for CUPED with multiple covariates and a calculation trick to make the delta method tractable for this situation.
- - Connections Between the Delta Method, OLS and CUPED, Illustrated, Empirical evidence of the equivalence of OLS and CUPED, plus special considerations for page-level metrics.
Recommendations
- - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a an implementation of Mixed Negative Sampling and a comparion to previous methods in this series of posts.
- - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a sketch of the proof of the LogQ correction for in-batch negatives.
- - Learning RecSys through Papers- Implementing a Candidate Generation Model, A modern-ish implementation of the candidate generation step of the "Deep Neural Networks For YouTube Recommendations" by Covington et al. with a discussion of next steps from other papers.
RecSys
- - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a an implementation of Mixed Negative Sampling and a comparion to previous methods in this series of posts.
- - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a sketch of the proof of the LogQ correction for in-batch negatives.
- - Learning RecSys through Papers- Implementing a Candidate Generation Model, A modern-ish implementation of the candidate generation step of the "Deep Neural Networks For YouTube Recommendations" by Covington et al. with a discussion of next steps from other papers.