OpenPAV Platform

Method

Data-driven PAV evaluation pipeline with accelerated testing and benchmark ranking outputs.

Evaluation Pipeline

Since PAV manufacturers often do not disclose internal control algorithms, OpenPAV proposes a data-driven pipeline for performance evaluation based on PAV-related data. This framework enables benchmarking without relying on proprietary AV controllers.

OpenPAV framework

The final outcome is a multi-dimensional ranking billboard of PAV performance, including safety, mobility, energy efficiency, and user comfort.

Download full pipeline code package:

Full Pipline Code

Shifted Power Law Behavior Model

When performing stochastic trajectory prediction, many neural network-based approaches can accurately predict mean behavior, but few methods explicitly model the full stochastic distribution. Typical formulations often assume a Gaussian distribution, which can lead to limited performance in risky or highly adversarial scenarios.

To reduce the modeling gap in the tail distribution of vehicle behaviors, we develop a simple shifted power law model that can represent risky driving behaviors using limited data. Results show that this model predicts both longitudinal and lateral acceleration accurately in the tail region, improving evaluation performance in rare but safety-critical conditions.

illustration of the shifted power low behavior model

Reference: Chen, W., Huang, H., Ma, K., Li, H., Liang, S., Zhou, H., Li, X. (2025). Unveiling uniform shifted power law in stochastic human and autonomous driving behavior. arXiv preprint arXiv:2511.00659.

Download behavior modeling code package:

Behavior Modeling Code

State-Based Evaluation

In our previous study, we proposed a Markov state-based evaluation framework to efficiently analyze a PAV's life-cycle performance. The PAV driving process is modeled as a Markov decision process, and the stochastic behavior models extracted from Step 2 are used to define transition probabilities between states. Under Markov chain theory, the stationary distribution represents long-term system behavior, so the metric distributions under that stationary distribution can represent the overall performance level of a given PAV model.

Because the number of driving states is extremely large, directly computing stationary distributions over all states is computationally infeasible. To address this, we apply state space aggregation: original states are partitioned into a smaller number of blocks based on performance metrics. We then compute stationary distributions over aggregated blocks instead of all individual states, which substantially reduces dimensionality and computational cost.

State distribution for state-based evaluation

Reference: Ma, C., Zhou, H., Li, X. (2026). Safety Validation of Automated Vehicles Using Markov Chain Aggregation. Working Paper.

Download evaluation code package:

Evaluation Code

Upload Methods and Code

For code submissions, please contact Technical Contributor Hang Zhou (hzhou364@wisc.edu).