How Built Environments Shape Robotaxi Crashes in San Francisco – Wang, Shuai; Jiang, Emmanuel; Jun, Youngsang
Find Locations
Incident List
To add a new incident, click on the map to select the location of the accident place.
How to Use This Site
Introduction
Robotaxi adoption is no longer hypothetical. This web app provides visualization how built environments shape robotaxi crash risks in San Francisco, allowing planners and policymakers to implement a "Try-Before-You-Build” approach.
How to Use
Search box
The search box, which is on the top left, allows you to find crashes at specific location such as near your home. You might want to search for an address and zoom into the map. The incident list will automatically filter to show only crashes within the current view.
Incident List
On the Incident List, crashes are filtered by the visible map area when zoomed in. You can add, edit, or delete incident entries. To add a new incident, click on the map to select the location of the accident place. Clicking an entry will allow you to zoom the map to that location.
Layer List
On the Independent Variable Layers Lists, toggle switches allow you to view the spatial distribution of each independent variable. Additionally, there are three crash density prediction scenarios based on a Random Forest (RF) model. The radar chart on the top right shows how each block group scores on the Top-4 SHAP-contributing variables, which are SVI enclosures, Population, Building densitiy, and Commericial POI density. A larger radar area indicates a higher predicted crash risk for that area.
Download Data
"Download Data” on the top right corner allows you to filter crash data by robotaxi brand and vehicle year, then download it as a JSON format.
Notes for Random Forest Model
Defining crash density as a dependent variable and others as independent variables, the modeling was conducted by three steps:
(1) filter out collinear variables with high VIF to ensure model stability and keep interpretation clean;
(2) build a baseline OLS model, then a Random Forest that can handle thresholds;
(3) evaluate with R², MAE, MSE and SHAP values for both global and local explanations.
The RF model demonstrated comparable explanatory power, achieving an R-squared of approximately 0.701 on standardized data, with an MSE of 0.123 and MAE of 0.268. Given its ability to handle non-linearities, subsequent interpretation focuses primarily on the RF model. The global importance and impact direction of each feature on the Random Forest model's prediction of crash density. The following figures are non-linear relationships between crash density and built environment variables.


