[Profile picture of Ruben Verborgh]

Ruben Verborgh

Predicting train occupancies based on query logs and external data sources

Gilles Vandewiele, Pieter Colpaert, Olivier Janssens, Joachim Van Herwegen, Ruben Verborgh, Erik Mannens, Femke Ongenae, and Filip De Turck

On dense railway networks—such as in Belgium—train travelers are frequently confronted with overly occupied trains, especially during peak hours. Crowdedness on trains leads to a deterioration in the quality of service and has a negative impact on the well-being of the passenger. In order to stimulate travelers to consider less crowded trains, the iRail project wants to show an occupancy indicator in their route planning applications by the means of predictive modelling. As there is no official occupancy data available, training data is gathered by crowd sourcing using the Web app iRail.be and the Railer application for iPhone. Users can indicate their departure & arrival station, at what time they took a train and classify the occupancy of that train into the classes: low, medium or high. While preliminary results on a limited data set conclude that the models do not yet perform sufficiently well, we are convinced that with further research and a larger amount of data, our predictive model will be able to achieve higher predictive performances. All datasets used in the current research are, for that purpose, made publicly available under an open license on the iRail website and in the form of a Kaggle competition. Moreover, an infrastructure is set up that automatically processes new logs submitted by users in order for our model to continuously learn. Occupancy predictions for future trains are made available through an API.

full text BibTeX other citation formats

Published in 2017 in Proceedings of the 7th International Workshop on Location and the Web.

Keywords:

Read this article online

Cite this article in your work

Cite this article easily using its BibTeX entry:

@inproceedings{vandewiele_locweb_2017,
  author = {Vandewiele, Gilles and Colpaert, Pieter and Janssens, Olivier and Van Herwegen, Joachim and Verborgh, Ruben and Mannens, Erik and Ongenae, Femke and De Turck, Filip},
  title = {Predicting train occupancies based on query logs and external data sources},
  year = 2017,
  month = apr,
  booktitle = {Proceedings of the 7th International Workshop on Location and the Web},
  url = {http://papers.www2017.com.au.s3-website-ap-southeast-2.amazonaws.com/companion/p1469.pdf},
  doi = {10.1145/3041021.3051699},
}

Alternatively, pick a reference of your choice below:

ACM
Gilles Vandewiele, Pieter Colpaert, Olivier Janssens, Joachim Van Herwegen, Ruben Verborgh, Erik Mannens, Femke Ongenae, and Filip De Turck. 2017. Predicting train occupancies based on query logs and external data sources. In Proceedings of the 7th International Workshop on Location and the Web.
APA
Vandewiele, G., Colpaert, P., Janssens, O., Van Herwegen, J., Verborgh, R., Mannens, E., Ongenae, F., & De Turck, F. (2017, April). Predicting train occupancies based on query logs and external data sources. Proceedings of the 7th International Workshop on Location and the Web.
IEEE
G. Vandewiele et al., “Predicting train occupancies based on query logs and external data sources,” in Proceedings of the 7th International Workshop on Location and the Web, 2017.
LNCS
Vandewiele, G., Colpaert, P., Janssens, O., Van Herwegen, J., Verborgh, R., Mannens, E., Ongenae, F., De Turck, F.: Predicting train occupancies based on query logs and external data sources. In: Proceedings of the 7th International Workshop on Location and the Web (2017).
MLA
Vandewiele, Gilles, et al. “Predicting Train Occupancies Based on Query Logs and External Data Sources.” Proceedings of the 7th International Workshop on Location and the Web, 2017.

Discuss this article