Dataset

randomization in online experiments

Most scientists consider randomized experiments to be the best method available to establish causality. On the Internet, during the past twenty-five years, randomized experiments have become common, often referred to as A/B testing. For practical reasons, much A/B testing does not use pseudo-random number generators to implement randomization. Instead, hash functions are used to transform the distribution of identifiers of experimental units into a uniform distribution. Using two large, industry data sets, I demonstrate that the success of hash-based quasi-randomization strategies depends greatly on the hash function used: MD5 yielded good results, while SHA512 yielded less impressive ones.

Data and Resources

code_and_data.zip ZIP Popular
How to use the code and data to replicate paper results: Inside the archive...
Downloads: 56
explore
- More information
- Download

Suggested Citation

Golyaev, Konstantin (2018): Randomization in Online Experiments. Version: 1. Journal of Economics and Statistics. Dataset. https://doi.org/10.15456/jbnst.2018192.235844

Related Publication

Golyaev, K. (2018). Randomization in Online Experiments. Jahrbücher für Nationalökonomie und Statistik, 238(3-4). doi: 10.1515/jbnst-2018-0006

randomization in online experiments

Data and Resources

Suggested Citation

Related Publication

Tags

JEL Codes