[en] Because of the rapid development of technology, larger data sets on activity travel behavior have become available. These data sets often lack semantic interpretation. This lack of interpretation implies that annotation of activity type and transportation mode is necessary. This paper aims to infer activity types from Global Positioning System (GPS) traces by developing a decision tree-based model. The model considers only activity start times and activity durations. On the basis of the decision tree classification, a probability distribution and a point prediction model were constructed. The probability matrix described the probability of each activity type for each class (i.e., combination of activity start time and activity duration). In each class, the point prediction model selected the activity type that had the highest probability. Two types of data were collected in 2006 and 2007 in Flanders, Belgium (i.e., activity travel data and GPS data). The optimal classification tree constructed contained 18 leaves. Consequently, 18 if-then rules were derived. An accuracy of 74% was achieved when the tree was trained. The accuracy of the model for the validation set (72.5%) showed that overfitting was minimal. When the model was applied to the test set, the accuracy was almost 76%. The models indicated the importance of time information in the semantic enrichment process. This study contributes to future data collection in that it enables researchers to infer activity types directly from activity start time and duration information obtained from GPS data. Because no location information is needed, this research can be easily and readily applied to millions of individual agents.
Research Center/Unit :
Lepur : Centre de Recherche sur la Ville, le Territoire et le Milieu rural - ULiège LEMA - Local Environment Management and Analysis
Disciplines :
Special economic topics (health, labor, transportation...) Civil engineering
Author, co-author :
Reumers, Sofie; Universiteit Hasselt - UH
Liu, Feng; Universiteit Hasselt - UH
Janssens, Davy; Universiteit Hasselt - UH
Cools, Mario ; Université de Liège - ULiège > Département Argenco : Secteur TLU+C > Transports et mobilité
Wets, Geert; Universiteit Hasselt - UH
Language :
English
Title :
Semantic Annotation of Global Positioning System Traces: Activity Type Inference
Publication date :
2013
Journal title :
Transportation Research Recordv: Journal of the Transportation Research Board
ISSN :
0361-1981
eISSN :
2169-4052
Publisher :
National Research Council, National Academy of Sciences. Commission on Sociotechnical Systems. Transportation Research Board, Washington, United States - District of Columbia
The On-Line Travel Survey Manual: A Dynamic Document for Transportation Professionals. Provided by the Members and Friends of the Transportation Research Board's Travel Survey Methods Committee (ABJ40), Washington, D. C., 2009. http://www.travelsurveymanual.org/
Sen, A., S. St, L. Yang, and E. Christopher. Household Travel Survey Nonresponse Estimates: The Chicago Experience. In Transportation Research Record 1493, TRB, National Research Council, Washington, D. C., 1995, pp. 170-177.
Wolf, J., R. Guensler, and W. Bachman. Elimination of the Travel Diary: Experiment to Derive Trip Purpose from Global Positioning System Travel Data. In Transportation Research Record: Journal of the Transportation Research Board, No. 1768, TRB, National Research Council, Washington, D. C., 2001, pp. 125-134. (Pubitemid 34254752)
Wolf, J., M. Oliveira, and M. Thompson. Impact of Underreporting on Mileage and Travel Time Estimates: Results from Global Positioning System-Enhanced Household Travel Survey. In Transportation Research Record: Journal of the Transportation Research Board, No. 1854, Transportation Research Board of the National Academies, Washington, D. C., 2003, pp. 189-198.
Asakura, Y., and E. Hato. Tracking Individual Travel Behaviour Using Mobile Phones: Recent Technological Development. In The Expanding Sphere of Travel Behaviour Research: Selected Papers from the 11th International Conference on Travel Behaviour Research (R. Kitamura, T. Yoshii, and T. Yamamoto, eds.), Emerald, Bingley, England, 2008, pp. 207-223.
Stopher, P. R., P. J. Bullock, and F. N. F. Horst. Conducting a GPS Survey with a Time-Use Diary. Presented at 82nd Annual Meeting of the Transportation Research Board, Washington, D. C., 2003.
Pendyala, R. M. Collection and Analysis of GPS-Based Travel Data for Understanding and Modeling Activity-Travel Patterns in Time and Space. University of South Florida, Tampa, undated.
Kochan, B., T. Bellemans, D. Janssens, G. Wets, and H. J. P. Timmermans. Assessment of the Quality of Location Data Obtained by the GPS-Enabled PARROTS Survey Tool. Presented at Social Positioning Method 2008, Tartu, Estonia, 2008.
Cools, M., E. Moons, T. Bellemans, D. Janssens, and G. Wets. Surveying Activity-Travel Behavior in Flanders: Assessing the Impact of the Survey Design. Proc., BIVEC-GIBET Transport Research Day 2009, Part II (C. Macharis and L. Turcksin, eds.), VUBPress, Brussels, Belgium, pp. 727-741.
Wolf, J., S. Schnfelder, U. Samaga, M. Oliveira, and K. W. Axhausen. Eighty Weeks of Global Positioning System Traces: Approaches to Enriching Trip Information. In Transportation Research Record: Journal of the Transportation Research Board, No. 1870, Transportation Research Board of the National Academies, Washington, D. C., 2004, pp. 46-54. (Pubitemid 40025780)
Battelle Transportation Division. Lexington Area Travel Data Collection Test Final Report. FHWA, U. S. Department of Transportation, Washington, D. C., 1997.
Wolf, J. L., M. G. S. Oliveira, P. Troped, C. E. Matthews, E. K. Cromley, and S. J. Melly. Mode and Activity Identification Using GPS and Accelerometer Data. Presented at 85th Annual Meeting of the Transportation Research Board, Washington, D. C., 2006.
Pearson, D. Global Positioning System (GPS) and Travel Surveys: Results from the 1997 Austin Household Survey. Proc., 8th Conference on the Application of Transportation Planning Methods, Tex., 2001.
Wolf, J. Using GPS Data Loggers to Replace Travel Diaries in the Collection of Travel Data. MS thesis. Georgia Institute of Technology, Atlanta, 2000.
Deng, Z., and M. Ji. Deriving Rules for Trip Purpose Identification from GPS Travel Survey Data and Land Use Data: A Machine Learning Approach. Traffic and Transportation Studies 2010, ASCE, Reston, Va., pp. 768-777.
Stopher, P., E. Clifford, J. Zhang, and C. FitzGerald. Deducing Mode and Purpose from GPS Data. Presented at 11th TRB National Transportation Planning Applications Conference, Daytona Beach, Fla., 2007.
McGowen, P. Predicting Activity Types from GPS and GIS Data. PhD dissertation. University of California, Irvine, 2006.
Schuessler, N., and K. W. Axhausen. Identifying Trips and Activities and Their Characteristics from GPS Raw Data Only. Proc., 8th International Conference on Survey Methods in Transport, Annecy, France, 2009.
Schuessler, N., and K. W. Axhausen. Processing Raw Data from Global Positioning Systems Without Additional Information. In Transportation Research Record: Journal of the Transportation Research Board, No. 2105, Transportation Research Board of the National Academies, Washington, D. C., 2009, pp. 28-36.
Zheng, Y., Y. Chen, Q. Li, X. Xie, and W.-Y. Ma. Understanding Transportation Modes Based on GPS Data for Web Applications. ACM Transactions on the Web, Vol. 4, No. 1, 2010, pp. 1-36.
Drazin, S., and M. Montag. Decision Tree Analysis Using Weka Machine Learning Project II. University of Miami, Fla., undated, pp. 1-3.
Bellemans, T., B. Kochan, D. Janssens, G. Wets, and H. J. P. Timmermans. Field Evaluation of Personal Digital Assistant Enabled by Global Positioning System: Impact on Quality of Activity and Diary Data. In Transportation Research Record: Journal of the Transportation Research Board, No. 2049, Transportation Research Board of the National Academies, Washington, D. C., 2008, pp. 136-143.
Wets, G., K. Vanhoof, T. Arentze, and H. J. P. Timmermans. Identifying Decision Structures Underlying Activity Patterns: An Exploration of Data Mining Algorithms. In Transportation Research Record: Journal of the Transportation Research Board, No. 1718, TRB, National Research Council, Washington, D. C., 2000, pp. 1-9. (Pubitemid 32790859)
Witten, I. H., E. Frank, and M. A. Hall. Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann Publishers, Elsevier Inc., Burlington, Mass., 2011.
Drummond, C., and R. C. Holte. C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling Beats Over-Sampling. Presented at Workshop on Learning from Imbalanced Datasets II, Washington, D. C., 2003.
Salzberg, S. L. Book Review: C4.5: Programs for Machine Learning by J. Ross Quinlan. Morgan Kaufmann Publishers, Inc., 1993.
Machine Learning, Vol. 16, No. 3, 1994, pp. 235-240.