2025

March 19, 2025 - Comparing Online Linear Regression and Kalman Filter Approaches to Online Models, plus derivations

2024

November 14, 2024 - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends
October 28, 2024 - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives
April 22, 2024 - Learning RecSys through Papers- Implementing a Candidate Generation Model
April 6, 2024 - Calculating Statistical Power When Your Analysis Requires the Delta Method
February 26, 2024 - CUPED with Multiple Covariates and A Simpler the Delta Method Calculation
February 14, 2024 - Connections Between the Delta Method, OLS and CUPED, Illustrated

2018

September 3, 2018 - Extending the Gaussian Mixture Approach for Fantasy Football Tiering
June 21, 2018 - Using Ordinary Differential Equations To Design State of the Art Residual-Style Layers
April 18, 2018 - Learning About Deep Reinforcement Learning (Slides)
March 23, 2018 - Understanding Attention in Neural Networks Mathematically
January 7, 2018 - Adversarial Dreaming with TensorFlow and Keras

2017

November 18, 2017 - Hogwild!? Implementing Async SGD in Python
October 15, 2017 - Covariate Shift, i.e. Why Prediction Quality Can Degrade In Production and How To Fix It
August 27, 2017 - An Annotated Proof of Generative Adversarial Networks with Implementation Notes
July 26, 2017 - A Tour of Gotchas When Implementing Deep Q Networks with Keras and OpenAi Gym
June 18, 2017 - Visualizing the Learning of a Neural Network Geometrically
March 19, 2017 - Dealing with Trends. Combine a Random Walk with a Tree-Based Model to Predict Time Series Data
March 5, 2017 - (My Opinion of) Best Practices for a Data Scientist in Industry
February 26, 2017 - Leveraging Factorization Machines for Wide Sparse Data and Supervised Visualization

2016

December 27, 2016 - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence
September 5, 2016 - Making Fantasy Football Projections Via A Monte Carlo Simulation
June 29, 2016 - Build Your Own Event-Based Backtester in Python
May 30, 2016 - Parsing HTML Tables in Python with BeautifulSoup and pandas
May 8, 2016 - Detect Communities in Your Steam Friends List with the Steam Web API and Graph Theory
May 1, 2016 - Why Blurring an Image is Similar to Warming Your Coffee
April 24, 2016 - On Solving Partial Differential Equations with Brownian Motion in Python
April 17, 2016 - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods
April 5, 2016 - Train a Neural Network to Play Black Jack with Q Learning
April 1, 2016 - Making a Markov Chain Twitter Bot in Python
March 30, 2016 - Eigen-vesting III. Random Matrix Filtering in Finance
March 28, 2016 - Connect The Dots. Least Squares, Linear Regression, and Bayesian Regression
March 26, 2016 - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3.
March 22, 2016 - Eigen-vesting II. Optimize Your Portfolio With Optimization
March 18, 2016 - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio
March 14, 2016 - How to Use Math to Win at Fantasy Football With a Lineup Optimizer
March 13, 2016 - Blogging with iPython using Jekyll
March 12, 2016 - Jitter, Convolutional Neural Networks, and a Kaggle Framework

Machine Learning

19 March 2025 - Comparing Online Linear Regression and Kalman Filter Approaches to Online Models, plus derivations, A derivation and implementation of Online Linear Regression and Kalman filters to estimate the weights of a linear model online.
14 November 2024 - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a an implementation of Mixed Negative Sampling and a comparion to previous methods in this series of posts.
28 October 2024 - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a sketch of the proof of the LogQ correction for in-batch negatives.
22 April 2024 - Learning RecSys through Papers- Implementing a Candidate Generation Model, A modern-ish implementation of the candidate generation step of the "Deep Neural Networks For YouTube Recommendations" by Covington et al. with a discussion of next steps from other papers.
21 June 2018 - Using Ordinary Differential Equations To Design State of the Art Residual-Style Layers, A walkthrough of the theory behind Residual layers with comments on current research.
18 April 2018 - Learning About Deep Reinforcement Learning (Slides), My slides from a talk for Data Philly on Deep Reinforcement Learning.
23 March 2018 - Understanding Attention in Neural Networks Mathematically, Explains the Attention Mechanism's equations and demonstrates them geometrically and probabilistically.
7 January 2018 - Adversarial Dreaming with TensorFlow and Keras, Details a loss function to perform adversarial dreaming in Python.
18 November 2017 - Hogwild!? Implementing Async SGD in Python, Explains the Hogwild! algorithm and walks through an implementation using the multiprocessing library.
15 October 2017 - Covariate Shift, i.e. Why Prediction Quality Can Degrade In Production and How To Fix It, Describes the Kullback-Leibler Importance Estimation Procedure and introduces a python package to use it.
27 August 2017 - An Annotated Proof of Generative Adversarial Networks with Implementation Notes, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
26 July 2017 - A Tour of Gotchas When Implementing Deep Q Networks with Keras and OpenAi Gym, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
18 June 2017 - Visualizing the Learning of a Neural Network Geometrically, Walking through how to visualize the training process of a neural network.
19 March 2017 - Dealing with Trends. Combine a Random Walk with a Tree-Based Model to Predict Time Series Data, Using statistics and machine learning for time series data.
5 March 2017 - (My Opinion of) Best Practices for a Data Scientist in Industry, Giving unsolicited advice to upcoming Data Scientists.
26 February 2017 - Leveraging Factorization Machines for Wide Sparse Data and Supervised Visualization, A nonlinear model and visualization approach great for sparse data.
5 April 2016 - Train a Neural Network to Play Black Jack with Q Learning, It turns out the traditional "stay at 15" is almost the best strategy.
12 March 2016 - Jitter, Convolutional Neural Networks, and a Kaggle Framework, A recipe for approaching Kaggle competitions.

Practical

19 March 2025 - Comparing Online Linear Regression and Kalman Filter Approaches to Online Models, plus derivations, A derivation and implementation of Online Linear Regression and Kalman filters to estimate the weights of a linear model online.
14 November 2024 - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a an implementation of Mixed Negative Sampling and a comparion to previous methods in this series of posts.
28 October 2024 - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a sketch of the proof of the LogQ correction for in-batch negatives.
22 April 2024 - Learning RecSys through Papers- Implementing a Candidate Generation Model, A modern-ish implementation of the candidate generation step of the "Deep Neural Networks For YouTube Recommendations" by Covington et al. with a discussion of next steps from other papers.
6 April 2024 - Calculating Statistical Power When Your Analysis Requires the Delta Method, Demonstration of a statistical power calculation when the variable in question requires the use of the delta method-- i.e., it is a ratio metric.
26 February 2024 - CUPED with Multiple Covariates and A Simpler the Delta Method Calculation, Theoretical derivation of the formula for CUPED with multiple covariates and a calculation trick to make the delta method tractable for this situation.
14 February 2024 - Connections Between the Delta Method, OLS and CUPED, Illustrated, Empirical evidence of the equivalence of OLS and CUPED, plus special considerations for page-level metrics.
3 September 2018 - Extending the Gaussian Mixture Approach for Fantasy Football Tiering, An extention to the Gaussian Mixture approach to tiering Players
26 July 2017 - A Tour of Gotchas When Implementing Deep Q Networks with Keras and OpenAi Gym, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
5 March 2017 - (My Opinion of) Best Practices for a Data Scientist in Industry, Giving unsolicited advice to upcoming Data Scientists.
27 December 2016 - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence, How to make fantasy football projections better with Bayesian techniques.
5 September 2016 - Making Fantasy Football Projections Via A Monte Carlo Simulation, How to make fantasy football projections with historic data and Monte Carlo techniques.
29 June 2016 - Build Your Own Event-Based Backtester in Python, Use multiprocessing to speed up your backtesting!
30 May 2016 - Parsing HTML Tables in Python with BeautifulSoup and pandas, How to use BeautifulSoup and pandas to grab data from the web.
8 May 2016 - Detect Communities in Your Steam Friends List with the Steam Web API and Graph Theory, Using discrete math to analyze your Steam friends.
5 April 2016 - Train a Neural Network to Play Black Jack with Q Learning, It turns out the traditional "stay at 15" is almost the best strategy.
1 April 2016 - Making a Markov Chain Twitter Bot in Python, For those of you who want to spam their followers with gibberish.
30 March 2016 - Eigen-vesting III. Random Matrix Filtering in Finance, Part three in a Series on How Math Fits in Modern Portfolio Theory
26 March 2016 - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3., A lightning tour of PyMC3 and Bayesian inference to solve (somtimes frustrating or impossible) pen and paper problems.
22 March 2016 - Eigen-vesting II. Optimize Your Portfolio With Optimization, Part Two in a Series on How Math Fits in Modern Portfolio Theory
18 March 2016 - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio, A Series on How Math Fits in Modern Portfolio Theory
14 March 2016 - How to Use Math to Win at Fantasy Football With a Lineup Optimizer, All the math in the world won't guarantee a win, but it could help.
12 March 2016 - Jitter, Convolutional Neural Networks, and a Kaggle Framework, A recipe for approaching Kaggle competitions.

Blog

13 March 2016 - Blogging with iPython using Jekyll, Converting code to blogs with iPython.

Math

3 September 2018 - Extending the Gaussian Mixture Approach for Fantasy Football Tiering, An extention to the Gaussian Mixture approach to tiering Players
26 February 2017 - Leveraging Factorization Machines for Wide Sparse Data and Supervised Visualization, A nonlinear model and visualization approach great for sparse data.
27 December 2016 - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence, How to make fantasy football projections better with Bayesian techniques.
5 September 2016 - Making Fantasy Football Projections Via A Monte Carlo Simulation, How to make fantasy football projections with historic data and Monte Carlo techniques.
1 May 2016 - Why Blurring an Image is Similar to Warming Your Coffee, Showing the connection between the heat equation and the Gaussian blur.
24 April 2016 - On Solving Partial Differential Equations with Brownian Motion in Python, When random walks solve deterministic equations
17 April 2016 - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods, Part four in a Series on How Math Fits in Modern Portfolio Theory
1 April 2016 - Making a Markov Chain Twitter Bot in Python, For those of you who want to spam their followers with gibberish.
30 March 2016 - Eigen-vesting III. Random Matrix Filtering in Finance, Part three in a Series on How Math Fits in Modern Portfolio Theory
28 March 2016 - Connect The Dots. Least Squares, Linear Regression, and Bayesian Regression, Sometimes it helps to draw a line or two.
26 March 2016 - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3., A lightning tour of PyMC3 and Bayesian inference to solve (somtimes frustrating or impossible) pen and paper problems.
22 March 2016 - Eigen-vesting II. Optimize Your Portfolio With Optimization, Part Two in a Series on How Math Fits in Modern Portfolio Theory
18 March 2016 - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio, A Series on How Math Fits in Modern Portfolio Theory
14 March 2016 - How to Use Math to Win at Fantasy Football With a Lineup Optimizer, All the math in the world won't guarantee a win, but it could help.

Sports

3 September 2018 - Extending the Gaussian Mixture Approach for Fantasy Football Tiering, An extention to the Gaussian Mixture approach to tiering Players
27 December 2016 - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence, How to make fantasy football projections better with Bayesian techniques.
5 September 2016 - Making Fantasy Football Projections Via A Monte Carlo Simulation, How to make fantasy football projections with historic data and Monte Carlo techniques.
14 March 2016 - How to Use Math to Win at Fantasy Football With a Lineup Optimizer, All the math in the world won't guarantee a win, but it could help.

Finance

29 June 2016 - Build Your Own Event-Based Backtester in Python, Use multiprocessing to speed up your backtesting!
17 April 2016 - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods, Part four in a Series on How Math Fits in Modern Portfolio Theory
30 March 2016 - Eigen-vesting III. Random Matrix Filtering in Finance, Part three in a Series on How Math Fits in Modern Portfolio Theory
22 March 2016 - Eigen-vesting II. Optimize Your Portfolio With Optimization, Part Two in a Series on How Math Fits in Modern Portfolio Theory
18 March 2016 - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio, A Series on How Math Fits in Modern Portfolio Theory

Theory

21 June 2018 - Using Ordinary Differential Equations To Design State of the Art Residual-Style Layers, A walkthrough of the theory behind Residual layers with comments on current research.
18 April 2018 - Learning About Deep Reinforcement Learning (Slides), My slides from a talk for Data Philly on Deep Reinforcement Learning.
23 March 2018 - Understanding Attention in Neural Networks Mathematically, Explains the Attention Mechanism's equations and demonstrates them geometrically and probabilistically.
7 January 2018 - Adversarial Dreaming with TensorFlow and Keras, Details a loss function to perform adversarial dreaming in Python.
18 November 2017 - Hogwild!? Implementing Async SGD in Python, Explains the Hogwild! algorithm and walks through an implementation using the multiprocessing library.
15 October 2017 - Covariate Shift, i.e. Why Prediction Quality Can Degrade In Production and How To Fix It, Describes the Kullback-Leibler Importance Estimation Procedure and introduces a python package to use it.
27 August 2017 - An Annotated Proof of Generative Adversarial Networks with Implementation Notes, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
18 June 2017 - Visualizing the Learning of a Neural Network Geometrically, Walking through how to visualize the training process of a neural network.
19 March 2017 - Dealing with Trends. Combine a Random Walk with a Tree-Based Model to Predict Time Series Data, Using statistics and machine learning for time series data.
26 February 2017 - Leveraging Factorization Machines for Wide Sparse Data and Supervised Visualization, A nonlinear model and visualization approach great for sparse data.
1 May 2016 - Why Blurring an Image is Similar to Warming Your Coffee, Showing the connection between the heat equation and the Gaussian blur.
24 April 2016 - On Solving Partial Differential Equations with Brownian Motion in Python, When random walks solve deterministic equations
17 April 2016 - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods, Part four in a Series on How Math Fits in Modern Portfolio Theory
5 April 2016 - Train a Neural Network to Play Black Jack with Q Learning, It turns out the traditional "stay at 15" is almost the best strategy.
1 April 2016 - Making a Markov Chain Twitter Bot in Python, For those of you who want to spam their followers with gibberish.
30 March 2016 - Eigen-vesting III. Random Matrix Filtering in Finance, Part three in a Series on How Math Fits in Modern Portfolio Theory
28 March 2016 - Connect The Dots. Least Squares, Linear Regression, and Bayesian Regression, Sometimes it helps to draw a line or two.
26 March 2016 - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3., A lightning tour of PyMC3 and Bayesian inference to solve (somtimes frustrating or impossible) pen and paper problems.
22 March 2016 - Eigen-vesting II. Optimize Your Portfolio With Optimization, Part Two in a Series on How Math Fits in Modern Portfolio Theory
18 March 2016 - Eigen-vesting I. Linear Algebra Can Help You Choose Your Stock Portfolio, A Series on How Math Fits in Modern Portfolio Theory

Statistics

6 April 2024 - Calculating Statistical Power When Your Analysis Requires the Delta Method, Demonstration of a statistical power calculation when the variable in question requires the use of the delta method-- i.e., it is a ratio metric.
26 February 2024 - CUPED with Multiple Covariates and A Simpler the Delta Method Calculation, Theoretical derivation of the formula for CUPED with multiple covariates and a calculation trick to make the delta method tractable for this situation.
14 February 2024 - Connections Between the Delta Method, OLS and CUPED, Illustrated, Empirical evidence of the equivalence of OLS and CUPED, plus special considerations for page-level metrics.
19 March 2017 - Dealing with Trends. Combine a Random Walk with a Tree-Based Model to Predict Time Series Data, Using statistics and machine learning for time series data.
27 December 2016 - Bayesian Hierarchical Modeling Applied to Fantasy Football Projections for Increased Insight and Confidence, How to make fantasy football projections better with Bayesian techniques.
5 September 2016 - Making Fantasy Football Projections Via A Monte Carlo Simulation, How to make fantasy football projections with historic data and Monte Carlo techniques.
17 April 2016 - Eigen-vesting IV. Predicting Stock and Portfolio Returns With Bayesian Methods, Part four in a Series on How Math Fits in Modern Portfolio Theory
1 April 2016 - Making a Markov Chain Twitter Bot in Python, For those of you who want to spam their followers with gibberish.
26 March 2016 - Don't Solve-- Simulate! Markov Chain Monte Carlo Methods with PyMC3., A lightning tour of PyMC3 and Bayesian inference to solve (somtimes frustrating or impossible) pen and paper problems.

Reinforcement Learning

26 July 2017 - A Tour of Gotchas When Implementing Deep Q Networks with Keras and OpenAi Gym, Exploring gotchas associated with coding your own DQN implementation based on Google DeepMind's Nature paper.
5 April 2016 - Train a Neural Network to Play Black Jack with Q Learning, It turns out the traditional "stay at 15" is almost the best strategy.

Python

1 May 2016 - Why Blurring an Image is Similar to Warming Your Coffee, Showing the connection between the heat equation and the Gaussian blur.

Data

5 March 2017 - (My Opinion of) Best Practices for a Data Scientist in Industry, Giving unsolicited advice to upcoming Data Scientists.
30 May 2016 - Parsing HTML Tables in Python with BeautifulSoup and pandas, How to use BeautifulSoup and pandas to grab data from the web.
8 May 2016 - Detect Communities in Your Steam Friends List with the Steam Web API and Graph Theory, Using discrete math to analyze your Steam friends.

A/B Testing

6 April 2024 - Calculating Statistical Power When Your Analysis Requires the Delta Method, Demonstration of a statistical power calculation when the variable in question requires the use of the delta method-- i.e., it is a ratio metric.
26 February 2024 - CUPED with Multiple Covariates and A Simpler the Delta Method Calculation, Theoretical derivation of the formula for CUPED with multiple covariates and a calculation trick to make the delta method tractable for this situation.
14 February 2024 - Connections Between the Delta Method, OLS and CUPED, Illustrated, Empirical evidence of the equivalence of OLS and CUPED, plus special considerations for page-level metrics.

Recommendations

14 November 2024 - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a an implementation of Mixed Negative Sampling and a comparion to previous methods in this series of posts.
28 October 2024 - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a sketch of the proof of the LogQ correction for in-batch negatives.
22 April 2024 - Learning RecSys through Papers- Implementing a Candidate Generation Model, A modern-ish implementation of the candidate generation step of the "Deep Neural Networks For YouTube Recommendations" by Covington et al. with a discussion of next steps from other papers.

RecSys

14 November 2024 - Learning RecSys through Papers Vol III- Mixed Negative Sampling + Odds and Ends, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a an implementation of Mixed Negative Sampling and a comparion to previous methods in this series of posts.
28 October 2024 - Learning RecSys through Papers Vol II- The How, What, and Why of In-Batch Negatives, Another modern-ish implementation of the candidate generation step of a recommender system in PyTorch with a sketch of the proof of the LogQ correction for in-batch negatives.
22 April 2024 - Learning RecSys through Papers- Implementing a Candidate Generation Model, A modern-ish implementation of the candidate generation step of the "Deep Neural Networks For YouTube Recommendations" by Covington et al. with a discussion of next steps from other papers.

Online

19 March 2025 - Comparing Online Linear Regression and Kalman Filter Approaches to Online Models, plus derivations, A derivation and implementation of Online Linear Regression and Kalman filters to estimate the weights of a linear model online.

Scott Rome

Archive

All posts

2025

2024

2018

2017

2016

Machine Learning

Practical

Blog

Math

Sports

Finance

Theory

Statistics

Reinforcement Learning

Python

Data

A/B Testing

Recommendations

RecSys

Online