Hey Google! will New Zealand vote to legalise cannabis? Using Google Trends data to predict the outcome of the 2020 New Zealand cannabis referendum.
Int J Drug Policy. 2020 Dec 17;90:103083
Authors: Raubenheimer JE, Riordan BC, Merrill JE, Winter T, Ward RM, Scarf D, Buckley NA
BACKGROUND: New Zealand held a referendum on the legalisation of recreational cannabis in October 2020. Polls preceding the referendum provided contrasting outcomes. We investigated whether internet search data from Google Trends could provide an alternative estimate of the referendum outcome.
METHODS: We assessed various methods for accessing Google Trends data, downloading search probability data for google.com searches from New Zealand via trends.google.com, PyTrends and Google Trends Extended for Health. We used daily data for the three months prior to the final referendum date, and hourly data for the final week. We defined two smaller time frames each from daily and hourly data, allowing comparisons over the entire time frames, and progressively closer to the end. Using the selected keyword combination of ‘cannabis referendum yes/no’ we calculated the proportions of ‘yes’ and ‘no’ searches for each time frame/data source combination, aiming for a prediction within 2% of the final result.
RESULTS: Data from different sources varied slightly. The method used to aggregate search probabilities over the selected time frame (mean/median) resulted in changes in the predicted outcome for hourly-, but not daily data. On 20 October we predicted the ‘no’ vote at 51.9%-55.4% for daily-, and 60% for hourly data when aggregated using the median, but only 49% for mean hourly data. Hourly data performed poorly at predicting the final 51.2% ‘no’ result, while predictions based on mean daily data for the full voting period provided the best prediction, differing by 0.1-0.2%.
CONCLUSION: Predictions based on Google Trends data broadly agreed with polling predictions, but the exact method used affected the eventual prediction. While polls are subject to influence from methodological considerations (e.g., sampling), it is clear that Google Trends data can be used to make a prediction, but do not present a magic bullet solution to polling problems.
PMID: 33341700 [PubMed – as supplied by publisher]
Source: ncbi 2