Back to Search
Start Over
Applying machine learning and geolocation techniques to social media data (Twitter) to develop a resource for urban planning
- Source :
- PLoS ONE, Vol 16, Iss 2, p e0244317 (2021), PLoS ONE
- Publication Year :
- 2021
- Publisher :
- Public Library of Science (PLoS), 2021.
-
Abstract
- With all the recent attention focused on big data, it is easy to overlook that basic vital statistics remain difficult to obtain in most of the world. What makes this frustrating is that private companies hold potentially useful data, but it is not accessible by the people who can use it to track poverty, reduce disease, or build urban infrastructure. This project set out to test whether we can transform an openly available dataset (Twitter) into a resource for urban planning and development. We test our hypothesis by creating road traffic crash location data, which is scarce in most resource-poor environments but essential for addressing the number one cause of mortality for children over five and young adults. The research project scraped 874,588 traffic related tweets in Nairobi, Kenya, applied a machine learning model to capture the occurrence of a crash, and developed an improved geoparsing algorithm to identify its location. We geolocate 32,991 crash reports in Twitter for 2012–2020 and cluster them into 22,872 unique crashes during this period. For a subset of crashes reported on Twitter, a motorcycle delivery service was dispatched in real-time to verify the crash and its location; the results show 92% accuracy. To our knowledge this is the first geolocated dataset of crashes for the city and allowed us to produce the first crash map for Nairobi. Using a spatial clustering algorithm, we are able to locate portions of the road network (
- Subjects :
- Geographic information system
Computer science
Social Sciences
Transportation
Crash
computer.software_genre
Machine Learning
Urban Environments
Resource (project management)
Sociology
Medicine and Health Sciences
Public and Occupational Health
City Planning
Multidisciplinary
Applied Mathematics
Simulation and Modeling
Accidents, Traffic
Social Communication
Transportation Infrastructure
Terrestrial Environments
Social Networks
Motorcycles
Physical Sciences
Engineering and Technology
Medicine
Safety
Network Analysis
Algorithms
Research Article
Computer and Information Sciences
Science
Twitter
Research and Analysis Methods
Machine learning
Civil Engineering
Machine Learning Algorithms
Artificial Intelligence
Urban planning
Humans
Social media
business.industry
Ecology and Environmental Sciences
Traffic Safety
Kenya
Communications
Roads
Geolocation
Artificial intelligence
business
Geoparsing
Social Media
computer
Mathematics
Subjects
Details
- Language :
- English
- ISSN :
- 19326203
- Volume :
- 16
- Issue :
- 2
- Database :
- OpenAIRE
- Journal :
- PLoS ONE
- Accession number :
- edsair.doi.dedup.....3be9a814e0b5cd15e74eb1fa72a9cdfc