Data4Policy.org

Data Science and Big Data in the Public Sector

About

Data4policy.org is an evolving web collection of articles and research papers about using data science and big data in the public sector. Data4policy.org is based at the University of Oxford and sponsored by the Cyber Studies Programme, currently maintained by Innar Liiv (Associate Professor, Tallinn University of Technology & Visiting Research Fellow, University of Oxford).

Data for Policy

ArticleYear
Kim, Gang-Hoon Trimi, Silvana Chung, Ji-Hyong, "Big-data applications in the government sector", pages 78-85, 20142014
Poel, Martijn Schroeder, Ralph Blackman, Colin, "Data for Policy: A study of big data and other innovative data-driven approaches for evidence-informed policymaking", 20152015
Zarsky, Tal Z, "Governmental data mining and its alternatives", HeinOnline, pages 285, 20112011
Rajagopalan, MR Vellaipandiyan, Solaimurugan, "Big data framework for national E-governance plan", ICT and Knowledge Engineering (ICT\&KE), 2013 11th International Conference on, pages 1-5, 20132013
Attard, Judie Orlandi, Fabrizio Scerri, Simon Auer, Soren, "A systematic review of open government data initiatives", Elsevier, pages 399--418, 20152015
Yiu, Chris, "The big data opportunity: Making government faster, smarter and more personal", Policy Exchange, 20122012
Williamson, Andy, "Big Data and the Implications for Government", Cambridge Univ Press, pages 253--257, 20142014
Cate, Fred H, "Government data mining: The need for a legal framework", 20082008
Slobogin, Christopher, "Government data mining and the fourth amendment", JSTOR, pages 317--341, 20082008
Morabito, Vincenzo, "Big Data and Analytics for Government Innovation", Big Data and Analytics, Springer, pages 23--45, 20152015
Munne, Ricard, "Big Data in the Public Sector", New Horizons for a Data-Driven Economy, Springer, pages 195--208, 20162016
Romijn, JH, "Using Big Data in the Public Sector. Uncertainties and Readiness in the Dutch Public Executive Sector", 20142014
Bertot, John Carlo Choi, Heeyoon, "Big data and e-government: issues, policies, and recommendations", Proceedings of the 14th Annual International Conference on Digital Government Research, pages 1--10, 20132013
Minow, Newton Cate, Fred H, "Government Data Mining", Mcgraw-Hill Handbook of Homeland Security, 20082008

Novel Data Sources

ArticleYear
Chunara, Rumi Andrews, Jason R Brownstein, John S, "Social and news media enable estimation of epidemiological patterns early in the 2010 Haitian cholera outbreak", ASTMH, pages 39--45, 20122012
Yuan, Qingyu Nsoesie, Elaine O Lv, Benfu Peng, Geng Chunara, Rumi Brownstein, John S, "Monitoring influenza epidemics in china with search query from baidu", Public Library of Science, pages e64323, 20132013

Policy for Data

ArticleYear
Janssen, Marijn van den Hoven, Jeroen, "Big and Open Linked Data (BOLD) in government: A challenge to transparency and privacy?", pages 363368, 20152015
Hu, Margaret, "Small Data Surveillance v. Big Data Cybersurveillance", pages 773-844, 20152015
Taylor, Nick, "To find the needle do you need the whole haystack? Global surveillance and principled regulation", Taylor \& Francis, pages 45--67, 20142014
Joh, Elizabeth E, "The New Surveillance Discretion: Automated Suspicion, Big Data, and Policing", 20152015

Agriculture, Forestry and Rural Development

ArticleYear
Mucherino, Antonio Papajorgji, Petraq Pardalos, Panos M, "A survey of data mining techniques applied to agriculture", Springer, pages 121--140, 20092009

Education and Culture

ArticleYear
Lakkaraju, Himabindu Aguiar, Everaldo Shan, Carl Miller, David Bhanpuri, Nasir Ghani, Rayid Addison, Kecia L, "A Machine Learning Framework to Identify Students at Risk of Adverse Academic Outcomes", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1909--1918, 20152015

Energy

ArticleYear
Mitra, Rajendu Kota, Ramachandra Bandyopadhyay, Sambaran Arya, Vijay Sullivan, Brian Mueller, Richard Storey, Heather Labut, Gerard, "Voltage Correlations in Smart Meter Data", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1999--2008, 20152015
Fei, Hongliang Kim, Younghun Sahu, Sambit Naphade, Milind Mamidipalli, Sanjay K Hutchinson, John, "Heat pump detection from coarse grained smart meter data with positive and unlabeled learning", Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1330--1338, 20132013

Environment

ArticleYear
Zhang, Y. Yi, Xiuwen Li, Ming Li, Ruiyuan Shan, Zhangqing Chang, Eric Li, Tianrui, "Forecasting fine-grained air quality based on big data", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 2267--2276, 20152015
Wang, Dawei Ding, Wei Yu, Kui Wu, Xindong Chen, Ping Small, David L Islam, Shafiqul, "Towards long-lead forecasting of extreme flood events: a data mining framework for precipitation cluster precursors identification", Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1285--1293, 20132013

Foreign Affairs

ArticleYear
Strauss, Nadine Kruikemeier, Sanne van der Meulen, Heleen van Noort, Guda, "Digital diplomacy in GCC countries: Strategic communication of Western embassies on Twitter", Elsevier, pages 369--379, 20152015

Health and Food Safety

ArticleYear
Paez, Diego Gachet Aparicio, Fernando Buenaga, Manuel Ascanio, Juan R. Hervas, Ramon Lee, Sungyoung Nugent, Chris Bravo, Jose, "Big Data and IoT for Chronic Patients Monitoring", Ubiquitous Computing and Ambient Intelligence. Personalisation and User Adapted Services, Springer, pages 416-423, 20142014
Zhang, Y. Qiu, M. Tsai, C. W. Hassan, M. M. Alamri, A., "Health-CPS: Healthcare Cyber-Physical System Assisted by Cloud and Big Data", pages 1-8, 20152015
Groves, Peter Kayyali, Basel Knott, David Kuiken, Steve Van, "The ?big data? revolution in healthcare: Accelerating value and innovation", 20132013
Perttula, Arttu Koivisto, Antti Makela, Riikka Suominen, Marko Multisilta, Jari, "Social Navigation with the Collective Mobile Mood Monitoring System", Proceedings of the 15th International Academic MindTrek Conference: Envisioning Future Media Environments, ACM, pages 117--124, 20112011
Beckman, Richard Bisset, Keith R Chen, Jiangzhuo Lewis, Bryan Marathe, Madhav Stretz, Paula, "Isis: A networked-epidemiology based pervasive web app for infectious disease pandemic planning and response", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1847--1856, 20142014
Park, Yubin Ghosh, Joydeep, "Ludia: An aggregate-constrained low-rank reconstruction algorithm to leverage publicly released health data", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 55--64, 20142014
Tran, Truyen Phung, Dinh Luo, Wei Harvey, Richard Berk, Michael Venkatesh, Svetha, "An integrated framework for suicide risk prediction", Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1410--1418, 20132013
Somanchi, Sriram Adhikari, Samrachana Lin, Allen Eneva, Elena Ghani, Rayid, "Early Prediction of Cardiac Arrest (Code Blue) using Electronic Medical Records", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 2119--2126, 20152015
Feldman, Ronen Netzer, Oded Peretz, Aviv Rosenfeld, Binyamin, "Utilizing Text Mining on Online Medical Forums to Predict Label Change due to Adverse Drug Reactions", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1779--1788, 20152015
Kate, Kiran Chaudhari, Sneha Prapanca, Andy Kalagnanam, Jayant, "FoodSIS: a text mining system to improve the state of food safety in singapore", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1709--1718, 20142014
Harpaz, Rave DuMouchel, William LePendu, Paea Shah, Nigam H, "Empirical Bayes model to combine signals of adverse drug reactions", Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1339--1347, 20132013
Potash, Eric Brew, Joe Loewi, Alexander Majumdar, Subhabrata Reece, Andrew Walsh, Joe Rozier, Eric Jorgenson, Emile Mansour, Raed Ghani, Rayid, "Predictive modeling for public health: Preventing childhood lead poisoning", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 2039--2047, 20152015
Shah, Nigam H, "Medicine in the age of electronic health records", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1518--1518, 20142014
Jee, Kyoungyoung Kim, Gang-Hoon, "Potentiality of big data in the medical sector: focus on how to reshape the healthcare system", pages 79--85, 20132013
Chunara, Rumi Andrews, Jason R Brownstein, John S, "Social and news media enable estimation of epidemiological patterns early in the 2010 Haitian cholera outbreak", ASTMH, pages 39--45, 20122012
Yuan, Qingyu Nsoesie, Elaine O Lv, Benfu Peng, Geng Chunara, Rumi Brownstein, John S, "Monitoring influenza epidemics in china with search query from baidu", Public Library of Science, pages e64323, 20132013

Housing and Urban Development

ArticleYear
Kermany, Einat Mazzawi, Hanna Baras, Dorit Naveh, Yehuda Michaelis, Hagai, "Analysis of Advanced Meter Infrastructure Data of Water Consumption in Apartment Buildings", Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM Press, pages 1159-1167, 20132013
Emerson, Daniel Weligamage, Justin Nayak, Richi, "A data mining driven risk profiling method for road asset management", Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM Press, pages 1267-1275, 20132013
Green, Ben Caro, Alejandra Conway, Matthew Manduca, Robert Plagge, Tom Miller, Abby, "Mining Administrative Data to Spur Urban Revitalization", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1829--1838, 20152015
Zoeter, Onno Dance, Christopher Clinchant, Stephane Andreoli, Jean-Marc, "New algorithms for parking demand management and a city-scale deployment", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1819--1828, 20142014
Zoeter, Onno Dance, Christopher Clinchant, Stephane Andreoli, Jean-Marc, "New algorithms for parking demand management and a city-scale deployment", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1819--1828, 20142014
Wu, Huayu Ng, Wee Siong Tan, Kian-Lee Wu, Wei Xiang, Shili Xue, Mingqiang, "A privacy preserving framework for managing vehicle data in road pricing systems", Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1427--1435, 20132013

Mobility and Transportation

ArticleYear
Xue, Mingqiang Wu, Huayu Chen, Wei Ng, Wee Siong Goh, Gin Howe, "Identifying tourists from public transport commuters", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1779--1788, 20142014
Holleczek, Thomas Yin, Shanyang Jin, Yunye Antonatos, Spiros Goh, Han Leong Low, Samantha Shi-Nash, Amy others, "Traffic Measurement and Route Recommendation System for Mass Rapid Transit (MRT)", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1859--1868, 20152015

Research and Innovation

ArticleYear
Spangler, Scott Wilkins, Angela D Bachman, Benjamin J Nagarajan, Meena Dayaram, Tajhal Haas, Peter Regenbogen, Sam Pickering, Curtis R Comer, Austin Myers, Jeffrey N others, "Automated hypothesis generation based on mining scientific literature", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1877--1886, 20142014
Duggan, Jennie Brodie, Michael L, "Hephaestus: Data Reuse for Accelerating Scientific Discovery.", CIDR, 20152015
Nagarajan, Meenakshi Wilkins, Angela D Bachman, Benjamin J Novikov, Ilya B Bao, Shenghua Haas, Peter Terron-Diaz, Maria E Bhatia, Sumit Adikesavan, Anbu K Labrie, Jacques J others, "Predicting Future Scientific Discoveries Based on a Networked Analysis of the Past Literature", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 2019--2028, 20152015
Kitano, Hiroaki, "Artificial Intelligence to Win the Nobel Prize and Beyond: Creating the Engine for Scientific Discovery.", 20162016

Treasury

ArticleYear
Dhurandhar, Amit Graves, Bruce Ravi, Rajesh Maniachari, Gopikrishanan Ettl, Markus, "Big Data System for Analyzing Risky Procurement Entities", Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1741--1750, 20152015
Junque de Fortuny, Enric Stankova, Marija Moeyersoms, Julie Minnaert, Bart Provost, Foster Martens, David, "Corporate residence fraud detection", Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1650--1659, 20142014