Kalye has received the following awards and nominations. Way to go!
Please view this graphic if you want an interactive material to understand what is Kalye.
https://my.visme.co/projects/w4jnq1ok-project-kalye
There are two sides to the story. Let us tell you a set of stories from the two perspectives to put things into context.
The Road User’s Story
Navigation apps are helpful in giving Helen the directions to places she’s never been to before. She’s living in a mega-city in the Philippines, the metropolis of Metro Manila. The city is incredibly busy and the roads are networked like cobwebs, with unprecedented intersections, rough and slippery roads, hard-to-manage inclines, often hard-to-navigate turns and exits, etc. The country is also often flooded by monsoon. The roads in the nearby cities such as Tagaytay and a little bit further away, Baguio City are also challenged by fogs causing poor visibility in the roads.
This interweaving difficulty in navigating unfamiliar roads and largely un-evaluated road conditions adds stress and Helen has often encountered accidents caused by these factors, and in several occasions, have been involved in one herself. In these situations, she’s always been wishing for the navigation apps to not just direct her but also to give her information on Road Safety Levels of the road sections she’s traversing.
Some apps do warn them of existing traffics on the road, or blockades, or accidents already reported by previous users. But, it does not warn her during normal days whether the road ahead will be dangerous as it is slippery, or there’s a big bend, or that there’s high pedestrian density, etc. If these insights were only accessible to her, not only will it ease her navigation, but it will definitely guide her driver behaviour such as reducing speed when necessary, or being more vigilant when in a high risk road section.
If only the navigation app is also able to give her insights not just of reported/current accidents ahead, but also guide her to be more careful by telling her the risk level of the road sections are.
The Road Safety Evaluator’s Story
Arthur is working as part of an urban development team, the city council, and works as a researcher in measuring safety levels of different road sections in the metropolis. He happens to be a colleague of Helen and have been exchanging stories with her on how complex the roads are in the city, and how it would be great if everyday road users can also know where are the risky roads, or the sections very prone to road hazards.
This prompted Arthur to initiate a localized research of the city’s road networks to create a holistic database of road safety levels of the city’s roads! He wanted to check if he can create a machine learning problem approach to modelling or evaluating the different hazard risk levels in Metro Manila’s roads using historical records of accidents from the past years. It would’ve been a straightforward classification approach where features are used to determine whether a road section is accident prone or not. However, even from the start of the research, he was met by several challenges:
Kalye had to think creatively. The problem can’t be blindly approached by just machine learning techniques. In fact, the methodology is devised by taking into account the above challenges of both stakeholders. Kalye’s methodology
Recall that the first layer of Kalye’s problem is how to generate a Synthetic Risk Score model without relying on historical data of road accidents (as there’s a lack or minimal availability of it). Note that if this is approached as a supervised classification problem, you can simply extract features such as road features where the accidents happened, user-related variables of the accident, etc. and run a model to determine how certain variables impact road hazards.
Since it can’t be approached this way, Kalye’s methodology involves creating a Synthetic Risk Score used to “grade/evaluate” the level of risk of road-related hazards of each road section in a network. Described as follows is the framework for calculating the risk score.
For the full methodology, please refer to https://github.com/HQuizzagan/kalye/blob/master/Kalye's%20Road%20Safety%20Scoring%20Framework.pdf
Take note that the Synthetic Risk Score is not the information thrown to the user. Instead, it's encoded into interpretable risk classifications. The raw scores won’t be easily interpreted by the user. For example, if Helen is driving along Road Section A and she sees the hazard risk level is 0.987, she wouldn’t easily know what means. As such, the raw scores are encoded as
Note that Kalye’s labelling adopted a tripartite system for ease of interpretation, making it user-friendly. For example, had we used the five-level categorization of very low risk, low risk, medium risk, high risk, and very high risk, it would be very hard for users to differentiate how to behave as driver in a High Risk vs. Very High Risk environment.
In using impact factors such as the vehicle-user factors, road-related factors, environment-factors, and time-varying factors, most models would simply take the product of each to generate the risk score. The rationale is because each impact factor simply scales the overall impact into the final risk.
However, Kalye opted for a weighted combination modelling approach and here are the advantages:
To maximize the user base of Kalye, it features an easy-to-access “Report an Accident” floating button. This feature would allow crowd-sourcing of data to centralize records of road accidents which can then later be used for further studies. Why is there no well-maintained database of road accidents? For context, we say well-maintained if it contains features regarding the incident that describes it well.
So how does Kalye’s Report an Accident help?
The reporting can now be done by anyone who uses the Kalye app. Road users who are then encountering road accidents can simply follow these very efficient steps:
How essential is this crowd-sourced and Kalye-managed database of road accidents?
As mentioned above, it is very hard to manually extract important information on road accidents for records purposes. With Kalye’s backend, the report is automatically generated by
The main role of satellite imagery is to measure the model features (i.e. variables under each of the four impact factors) without need of manual surveillance. This makes the methodology easy to adopt since it is an Automated Hazard Detection Framework. Presented below are very brief overviews of how satellite images were used by Kalye in assessing the road hazards!
Curve identification and information extraction can be made possible by employing curve extraction methodologies on satellite images and GIS shapefiles of road networks. We determined three features that will help contribute to the project goals: curve type, curve length, and curve degree (basically, “sharpness). Using either satellite images or GIS shapefiles present different advantages and disadvantages, which we discuss below.
Easa, S. M., Dong, H., & Li, J. (2007) presented a method for establishing road horizontal curves from satellite imagery. Canny edge detector method was implemented, which involved the conversion of the colored image to a gray image and creation of an edge image by locating abrupt changes in the intensity function. The Hough transform, a popular algorithm for detecting features from raster images, was used to detect the tangents straight lines and the corresponding horizontal curves. The authors were able to accurately establish simple and reverse curves for a complex freeway interchange (Fig. 1) as well as extract the parameters of the curves, including radii, start point, and end point. However, the paper only focused on establishing horizontal curves for only one side of the road: inside or outside edge. While it is possible to apply the proposed method twice, there is no guarantee that the two sides of the curve will share a similar center, as they should.
Fig. 1. Results of establishing (a) a simple horizontal curve and (b) a reverse horizontal curve at a freeway interchange.
An ArcGIS add-in tool, CurveFinder, developed by Li et. al. (2012 & 2015) uses GIS roadway maps to extract horizontal curve data. The fully automatic tool makes use of a curve data-extraction algorithm that: (a) detects all curves from each road in a selected roadway layer, regardless of the type of curve; (b) classifies each curve into one of two categories: simple or compound; (c) computes the radius and degree of curvature for each simple curve, as well as the curve length for simple curves and compound curves; and (d) creates curve features and layers for all identified curves in the GIS. They were able to fully demonstrate horizontal curve data extraction and curve type identification. However, low-quality GIS roadway segments are a major cause of error, for two scenarios (Fig. 2): (a) deviation from actual roadway alignment; and (b) the low-vertex resolution of the GIS roadway centerline to describe the actual alignment of the roadway.
Fig. 2. Typical scenarios of low-quality GIS source data: (a) Scenario 1 and (b) Scenario 2.
We propose a methodology to combine both of these approaches to fully maximize the data. We make use of satellite imagery data to create an edge map of the road network. The centerline of the edges are then computed, creating a roadway polyline. This polyline network is converted to a shapefile that can be fed into CurveFinder, and we subsequently extract the relevant information.
Base reference: Extracting Topographic Data from Online Sources to Generate a Digital Elevation Model for Highway Preliminary Geometric Design
Two ways to extract ASTER data from Google Earth:
Traditional
Standard
Data Extraction by Image Recognition
Data Storage and Management
Validation of Extraction
DEM Generation
Why use MODIS?
The MODIS instrument is aboard the Terra and Aqua spacecraft. The geographical coverage of the instrument allows views of the entire surface of the Earth and has a passby frequency of every one to two days (NASA). In the context of application, it can allow calculation of average visibility in major road networks every day.
How to measure ground fog levels using MODIS data?
Kalye adopted the algorithm implemented by Bendix, et al. in their research entitled “Ground Fog Detection from Space Based on MODIS Daytime Data—A Feasibility Study”. This allows an efficient and proven accuracy in terms of NOWCASTING ground visibility due to fog formation.
The strength of the methodology is that even if it is using daytime data, it can exclude mid- and high-level clouds from the detection. The effect is that it won’t just calculate the probability of poor ground visibility. Instead, it demonstrated promising nowcasting of true visibility at ground level. Kalye will extend this to grade road visibility during heavy outpour of rain.
The video demonstration can be viewed here
https://drive.google.com/drive/folders/14dOiNukjSIRy3zwHop9yHv24im5pCSzT?usp=sharing
If you want to download the play with the prototype app, please access the APK here:
Experience the Kalye app’s UI here: http://bit.ly/KalyePrototype