According to statistics, more than 90% of traffic accidents are related to the drivers[1]. Driving behavior analysis is of great significance in improving the issue of road traffic safety, carrying out active safety management of vehicles and implementing energy conservation and emission reduction. Previous driving behavior evaluation methods mainly focused on operating vehicles (including operating buses and trucks). The research methods are relatively advanced, including the analysis of driver monitoring data, controller area network (CAN) data and traffic flow information for driving behavior analysis and safety control. Driving behavior research for non-operating vehicles mainly relies on the naturalistic driving test, but naturalistic driving test has shortcomings such as limited sample coverage, high requirements for equipment and so it is difficult to promote this method widely. Vehicle satellite positioning research has the advantages of being low cost, convenient operation and is not time-consuming. With the popularity of smart phones, the massive positioning data generated by navigation software can be of great use in driving behavior research, and the results can be more representative and accurate, therefore, making the study of citywide driving behavior study possible. This paper analyzes the desensitized GPS navigation data provided by a map software programme for driving behavior studies.
In the field of operating vehicles’ driving behavior studies, little research has been done. Fatigue driving and the following experiment of 40 professional drivers proved that the degree of fatigue has a significant impact on headway distance[2]. Thiffault et al.[3] proved that drivers’ fatigue degrees vary significantly in a monotonous environment compared with a complex environment. Guo et al.[4] used an analytic hierarchy process to obtain the variation law of CAN data corresponding to drivers’ fatigue driving process. Analysis of operating vehicle traffic accidents proved that lack of safety awareness, risk perception and driving skill defects are the main causes of traffic accidents for operating vehicle drivers[5]. Based on CAN and video surveillance data, a relatively complete research method system is established.
In the field of non-operating vehicles’ driving behavior studies, Zhang et al.[6] used naturalistic driving data to build a lane-changing decision-making model, and established lane utility function in off-ramp areas, respectively, in free flow, steady flow and congested flow. They also generalized lane-changing characteristic under traffic flow state evolution. By analyzing naturalistic driving data, Wang et al.[7] comfirmed that: 1) A vehicle forward collision avoidance warning system can lower car following interval time significantly; 2) Multi-lane changing proportion of high-grade roads is higher[8]; 3) Chinese domestic drivers tended to be more aggressive than foreign drivers while changing lanes[9]. Olsen et al.[10] used American naturalistic driving data while conducting the study of drivers’ lane changing behavior. Bone et al.[11] studied the traits of aggressive and distracted drivers. Paaver et al.[12] proved that with proper education and intervention, new drivers’ risk driving behavior can drop significantly. Ahmed et al.[13-14] established the highway risk evaluation model with parameters such as traffic volume, speed, occupancy, weather and road geometric shape. Li et al.[15] proved that the visual environment of a curved section in a mountain has a significant impact on drivers’ perception-response time. Non-operating vehicle test data has high precision, but the data acquisition cost is too high, and test results are not representative enough.
In GPS data research field, Grengs et al.[16] conducted an in-depth analysis for four-week GPS positioning data from 78 users, and put forward the research methods of individual travel characteristics based on GPS and GIS. Zhao et al.[17] carried out behavior research on elderly driving behavior using GPS data. Liao et al.[18] carried out driving trajectories modelling based on GPS data. Nowadays, GPS positioning data is mainly used for highway driving behavior recognition and dangerous driving behavior detection based on operating vehicle supervision platform data in China. However, the data sample of the current research is relatively small[18] or mainly focuses on operating vehicle drivers[19]. Relative research exploited the use of GPS navigation data, yet further use such as the study of general motor vehicle drivers and application in traffic safety can be done in-depth. This paper analyzes the driving behavior characteristics by high-precision mass vehicle GPS data to reveal drivers’ driving behavior features and common unsafe behavior, so that we can take more directive measures to improve traffic safety.
Raw data contains the vehicle-navigation data of a one-month period in Shenzhen, as describted in Tab.1.
Tab.1 Raw data sample
DateTimeIDStatusLongitude/(°)Latitude/(°)Speed/(km·h-1 Direction/ ° 2017-05-0920 54 441743525134263432268511114.39437722.74768740242Note Status 11 means navigating software is on 10 means navigating software is off Direction is the heading direction angle of vehicle towards north.
The total volume of data is about 500 GB. Original data contains lots of invalid data. Positioning offset is a common problem and some users’ data lasts a quite short period of time, which is not worthy of studying. To tackle these problems, the record data corresponding to 10 users with the highest number of record in units of each 5 min is chosen to ensure the validity of data, which means that for each hour period, 120 samples are chosen at most. Data from the morning peak (8:00—9:00), night (21:00—22:00) and deep night (0:00—1:00) are selected for data analysis.
1.2.1 Auto encoder
In this research,the input and output nodes of network are consistent with vectors of GPS transportation data:
(1)
where δ is the proportion of over-speeding time; va is the average speed of vehicle; vs is the standard deviation of vehicle speed; as is the standard deviation of vehicle acceleration; is the standard deviation of vehicle acceleration where a<0.
Each element of X is the result after standardization.
(2)
where δi is the standard deviation of X; is the average value of X.
The training set of the auto encoder is defined as
V={X1,X2,...,Xn}T n∈M
(3)
where Xi is a summary sample of vehicle data with a unique ID. The model is a five-layer self-coding network, and it is used for compressing information and extracting core characteristics for later analysis. The auto encoder has the advantage of reducing noise with minimum loss of raw data.
The activation function used in this model is tanh function:
(4)
1.2.2 Self-organizing Mapping
Self-organizing mapping (SOM) is an unsupervised training neural network that automatically clusters input patterns through self-training. SOM neural network classifies the input mode set by finding an optimal reference vector set, and each reference vector is a connection weight vector corresponding to an output unit. Compared with the traditional pattern clustering method, the cluster center formed by SOM can be mapped to surface or plane while keeping the topology unchanged.
Specific implementation steps are shown in Fig.1.
Fig.1 SOM computing process
The auto encoder is used to reduce the noise of raw data, and then SOM is used to classify driving behaviors as safe, relatively safe, relatively aggressive and aggressive, as shown in Fig.2. Take speed, acceleration, and lateral offset as input values, and by using SOM and cluster analysis, the proportion distribution of different dangerous driving behaviors is carried out and shown in Tab.2. The proportions are calculated based on raw data characteristics and clustered by SOM, so no thresholds are set manually, and η is fixed according to the classified effectiveness.Relative thresholds setting is shown in Tab.2.
Fig.2 Percentage of various drivers
Tab.2 Thresholds settings for various dangerous driving behaviors
Dangerous driving behaviorIndicatorsCalculate methodThresholdsOver speedingSpeedThresholds judgementBased on road speed limitRapid speed changingAbsolute value of ac-celeration time per-centageBased on time percentage of |a|>4 m/s2 and SOM is used to calculate rapid speed change proportion|a|>4 m/s2Rapid turningSpeed and angular speedThresholds judgementv<40 km/h angle change in 3 s period is greater than 90° v>40 km/h angular speed is greater than 5 °/sRapid lane changingDirection angle aver-age value direction standard deviationSOM classificationAverage value and standard deviation of angular speed are 0.239 72.69 0.061 49.5 0.188 31.82 0.092 12.65 Note a is the acceleration v is the vehicle speed.
The proportion of different dangerous driving behaviors is shown in Tab.3.
Tab.3 Proportion distribution of different dangerous driving behaviors %
Over speedingRapid speed changeRapid turningRapid lane changing18141223
Driving direction is used to calculate rapid lane changing and rapid turning proportion. The proportion of rapid turning and rapid lane changing is calculated separately at the intersection area and road area, and the proportion is given based on SOM classification results. 16.1% of Shenzhen drivers are aggressive or relatively aggressive drivers, and over speeding and rapid lane changing the are the main dangerous driving behaviors.
Three time periods were extracted to carry out the Shenzhen driving behavior characteristics study, and the chosen time periods are morning peak (8:00—9:00), night (21:00—22:00) and deep night (0:00—1:00). Relative proportion is shown in Tab.4. Traffic flow is the highest during morning peak, and night period represents normal driving behavior. During deep night, drivers are tired, so that fatal and serious injury accidents with injuries are likely to occur.
Tab.4 Driving behavior of different time periods
Time periodAverages peed/ km·h-1 Speed std/ km·h-1 Proportion of low speed vehicles/%Time proportion of low speed driving/%Dangerous driving timeproportion/%0 00—1 00 deep night 63.910.200.98.88 00—9 00 morning peak 55.013.41.96.08.121 00—22 00 night 57.711.902.07.0
In Tab.4, the average speed is the average value of all vehicles; speed std is the average value of all vehicles’ standard variation; the proportion of low speed vehicles is the proportion of vehicles travelling with a low speed(v<10 km/h) more than 40% of total travelling time; the time proportion of low speed travelling is the time proportion of low speed travelling (v<10 km/h); dangerous driving time proportion is the time percentage of |speed×acceleration|>100[20](km/h, m/s2).
Conclusions can be drawn from Tab.4. First, drivers drive the fastest in deep night, during which dangerous driving time proportion is the highest, and drivers tend to be more aggressive. Secondly, drivers travel at the lowest speed during the morning peak period when the low speed vehicle proportion is the highest. Drivers accelerate and decelerate frequently, and the speed variation is the highest. Thirdly, drivers’ behaviors at 21:00—22:00 are the safest.
There is a large difference between the infrastructure inside and outside the original Shenzhen Economic Special Zone, as well as the drivers’ driving behavior. This paper takes Shekou area (in the original economic special zone) and East Shajing area (outside the original economic special zone) as examples to compare driving behavior in Shenzhen. The roads in the two chosen areas are mainly low-grade urban roads, as high-grade roads often cut across many administrative districts. High-grade roads mainly function for accessibility, while low-grade roads mainly function for flexibility. Continuity of traffic flow on low-grade roads is weak, which can better reflect regional travel characteristics. The chosen areas are shown in Fig.3, and a driving behavior comparison is given in Tab.5.
Fig.3 Typical traffic area
Tab.5 Driving behavior comparison between areas inside and outside the special economic zone
Time periodAreaAveragespeed/ km·h-1 Speed std/ km·h-1 Proportion of lowspeed vehicles/%Time proportion of low speed driving/%Dangerous driving time proportion/%0 00—1 00 deep night East Shajing25.714.014.5 23.5 3.9Shekou33.814.713.2 21.5 7.68 00—9 00 morning peak East Shajing16.811.247.0 48.0 2.0Shekou19.411.841.7 42.6 4.021 00—22 00 night East Shajing21.312.129.4 33.1 3.0Shekou21.410.140.1 41.8 4.3
Comparing and analyzing the driving behavior inside and outside the original economic special zone, it can be summarized that the average driving speed of Shekou area is always higher than that of East Shajing. Low speed driving proportion in Shekou area during the morning peak and deep night is also lower than that in East Shajing, indicating that the infrastructure conditions in Shekou area are better than East Shajing’s. The Shekou area has a higher proportion of dangerous driving time, which is more risky for driving.
This section analyzes the driving behavior of Beihuan Avenue, Longgang Avenue, Binhe Avenue and Meiguan Road. Results are shown in Tab.6.
According to the analysis, the overspeed phenomenon of Beihuan Avenue and Meiguan Road during the off-peak hours is the most serious, and the dangerous driving time proportion is the highest. The random lane changing behavior of Binhe Avenue and Beihuan is more serious. The relationship between accidents with average speed and dangerous driving time are as shown in Fig.4, Fig.5 and Tab.7.
According to the analysis, dangerous driving proportion during the late night period shows a good correlation with relative accident density, indicating that the proportion of dangerous driving behavior during the late night period can be used as a good measure of road safety.
Tab.6 Driving behavior of different roads %
Road sectionTime periodOver speeding percentageDangerous driving time proportionRandom lane changing proportionBeihuan Avenue0 00—1 0017.010.017.58 00—9 009.03.314.021 00—22 0010.17.114.0Longgang Avenue0 00—1 0012.47.317.08 00—9 008.94.812.221 00—22 0010.86.915.0Binhe Avenue0 00—1 0014.58.521.08 00—9 007.93.414.021 00—22 008.04.514.4Meiguan Road0 00—1 0015.97.911.08 00—9 005.82.816.921 00—22 009.45.812.5
Fig.4 Relationship between average speed variation and
Fig.5 Relationship between dangerous driving time proportion and
Tab.7 Dangerous driving vehicles proportion-accident density correlation
Road sectionDangerous driving vehicles proportion/%0 00—1 008 00—9 0021 00—22 00Relative accident density/ times· year·km -1 Hongli Road5.22.54.225.8Hongling Road7.53.54.533.9Huanggang Road6.72.83.953.0Shennan Road5.62.82.833.3Xinzhou Road8.53.34.054.9
To analyze the difference between the driving behavior of normal roads and accident-prone roads, the Beihuan-Huanggang road section and Beihuan-Caitian road sections (see Fig.6) are taken as examples, and the relative indices are shown in Tabs.8 to 10.
According to analysis, at deep night, the travelling speed of accident-prone roads is significantly higher than that of normal roads, and sharp turns, rapid acceleration/deceleration behavior proportions are significantly higher than those of normal road sections.
Fig.6 Location distribution of normal and accident-prone roads
Tab.8 Speed comparison between normal and accident-prone roads km/h
Time periodWeekdaysWeekendsNormal sectionAccident-proneNormal sectionAccident-prone sections0 00—1 00576054628 00—9 003940291221 00—22 0019192545
Tab.9 Acceleration proportion distribution comparison between normal and accident-prone roads %
Acceleration/ m·s-2 WeekdaysWeekendsNormal sectionAccident-prone sectionsNormal sectionAccident-prone sections -3 -2 1.72.11.42.4 -2 -1.38 2.43.22.53.7 -1.38 1.38 90.789.491.288.3 1.38 2 3.23.32.93.4 2 3 2.02.02.02.2
Tab.10 Sharp turn proportion comparison between normal and accident-prone roads %
Time periodNormal sectionsAccident-prone sectionsSharp turnNormal drivingSharp turnNormal drivingWeekdays397694Weekends397496
1) Based on the large volume of positioning data, this paper analyzes the driving behavior of Shenzhen drivers, gives the spatial and temporal distribution characteristics of driving behavior, and analyzes the driving behaviors of important passages and the correlation between dangerous driving behaviors and accidents. The method used in this paper is proved to be effective in analyzing GPS trajectory data, and provides a new way to study driving behavior. Rapid lane changing is the most common traffic violation in Shenzhen.
2) 16.1% drivers’ behavior is relatively aggressive, and they are more accident-prone, so that more traffic management measures should be taken upon them.
3) During deep night, drivers tend to over-speed; and during the morning peak, due to traffic congestion, drivers tend to accelerate and decelerate frequently.
4) Comparison between the driving behavior on similar roads inside and outside the original economic special zone indicates that due to the difference of development levels, the infrastructure level inside is higher than that outside, which creates the difference in dangerous driving proportion.
5) Dangerous driving behavior during deep night is highly correlated to accident densities.
[1]Tang Z Z, Zhang T J, He Y. Road safety assessment[M]. Beijing: China Communications Press, 2008. (in Chinese)
[2]Wu B, Yang Z Z, Xie J, et al. Vehicle speed distribution characteristics on work zone section of freeway[J]. Journal of Transportation Systems Engineering and Information Technology, 2016, 16(2): 219-224, 231. DOI:10.16097/j.cnki.1009-6744.2016.02.033.(in Chinese)
[3]Thiffault P, Bergeron J. Fatigue and individual differences in monotonous simulated driving[J]. Personality and Individual Differences, 2003, 34(1): 159-176. DOI:10.1016/s0191-8869(02)00119-8.
[4]Guo S, Teng J, Guo X, et al. Research on fatigue driving of commercial vehicle drivers based on multi-source real data [C]//China Intelligent Transportation Annual Meeting. Guangzhou, China, 2014:85-96. (in Chinese)
[5]Tao J. The deep analysis of traffic accidents drivers’ causes in operating vehicles[J]. Shanxi Science and Technology of Communications,2018, 253(4):155-158. (in Chinese)
[6]Zhang L F, Chen C, Zhang J Y, et al. Modeling lane-changing behavior in freeway off-ramp areas using naturalistic driving data[J]. Journal of Tongji University (Natural Science), 2018, 46(3):318-325,333. (in Chinese)
[7]Wang X S, Zhu M X, Xing Y L. Impacts of collision warning system on car-following behavior based on naturalistic driving data [J]. Journal of Tongji University (Natural Science), 2016, 44(7):1045-1051. (in Chinese)
[8]Wang X S, Li Y. Characteristics analysis of lane changing behavior based on the naturalistic driving data[J]. Journal of Transport Information and Safety, 2016, 34(1): 17-22. DOI:10.3963/j.issn 1674-4861.2016.01.001. (in Chinese)
[9]Wang X S, Yang M M. Cut-in behavior analyses based on naturalistic driving data[J]. Journal of Tongji University (Natural Science), 2018, 46(8): 1057-1063. DOI:10.11908/ j.issn.0253-374x.2018.08.008.(in Chinese)
[10]Olsen E C B, Lee S E, Wierwille W W, et al. Analysis of distribution, frequency, and duration of naturalistic lane changes[J].Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 2002, 46(22): 1789-1793. DOI:10.1177/154193120204602203.
[11]Bone S A, Mowen J C. Identifying the traits of aggressive and distracted drivers: A hierarchical trait model approach[J]. Journal of Consumer Behaviour: An International Research Review, 2006, 5(5): 454-464. DOI:10.1002/cb.193.
[12]Paaver M, Eensoo D, Kaasik K, et al. Preventing risky driving: A novel and efficient brief intervention focusing on acknowledgement of personal risk factors[J]. Accident Analysis & Prevention, 2013, 50: 430-437. DOI:10.1016/j.aap.2012.05.019.
[13]Ahmed M, Yu R, Abdel-Aty M. Safety applications of automatic vehicle identification and real-time weather data on freeways[C]//18th ITS World Congress. Orlando, FL, USA, 2011.
[14]Ahmed M, Abdel-Aty M. A data fusion framework for real-time risk assessment on freeways[J]. Transportation Research Part C: Emerging Technologies, 2013, 26: 203-213. DOI:10.1016/j.trc.2012.09.002.
[15]Li Y, Chen Y R. Driver vision based perception-response time prediction and assistance model on mountain highway curve[J]. International Journal of Environmental Research and Public Health, 2016, 14(1): 31. DOI:10.3390/ijerph14010031.
[16]Grengs J, Wang X G, Kostyniuk L. Using GPS data to understand driving behavior[J]. Journal of Urban Technology, 2008, 15(2): 33-53. DOI:10.1080/10630730802401942.
[17]Zhao Y N, Yamamoto T, Morikawa T. An analysis on older driver’s driving behavior by GPS tracking data: Road selection, left/right turn, and driving speed[J]. Journal of Traffic and Transportation Engineering (English Edition), 2018, 5(1): 56-65. DOI:10.1016/j.jtte.2017.05.013.
[18]Liao L, Wu J, Zou F, et al. Trajectory topic modelling to characterize driving behaviors with GPS-based trajectory data[J]. Journal of Internet Technology, 2018, 19(3): 815-824.
[19]Liu Y J, Zeng C, Wang S J, et al. An evaluation method of safety and energy-saving driving behavior based on satellite positioning data[J]. Journal of Highway and Transportation Research and Development, 2018, 35(1): 121-128, 158.(in Chinese)
[20]Zhu X Y, Hu X B, Chiu Y C. Design of driving behavior pattern measurements using smartphone global positioning system data[J]. International Journal of Transportation Science and Technology, 2013, 2(4): 269-288. DOI:10.1260/2046-0430.2.4.269.