ANG: Accelerating NFA processing on GPUs via Exploring Multi-Level Fine-Grained Parallelism

Document Type

Conference Proceeding

Publication Date

12-16-2025

Department

Department of Computer Science

Abstract

Finite Automata (FA) processing is a core computation in various real-world applications. Over the past decades, extensive efforts have been dedicated to accelerating FA processing on modern parallel platforms, particularly GPUs, due to their high memory bandwidth and massive hardware parallelism. As Non-deterministic Finite Automata (NFA)-based applications have strong and growing demands for real-time data analytics nowadays, reducing latency in automata processing has become a critical priority. However, existing approaches face significant challenges when limited parallelism is exposed in NFA computations. In this work, we explore opportunities of introducing fine-grained parallelism from various sources and addressing the limitations of fast NFA processing. Specifically, by analyzing different NFA parallelization schemes, we identify the major performance issue caused by insufficient state-level parallelism in conventional designs. To overcome the bottleneck, this work introduces speculative parallelization tailored for GPU-based NFA processing, thus effectively exploiting fine-grained parallelism across multilevels, with a particular focus on input-chunk-level parallelism. To realize speculative parallelization in practice, we develop ANG, a latency-oriented NFA processing framework that overcomes key implementation challenges on GPUs. We evaluate the efficiency of ANG on a set of representative NFAs with diverse properties. Experimental results demonstrate that ANG achieves significant performance improvement compared to state-of-theart techniques, with reaching 11.74× speedup on average (and up to 49.88× in extreme cases).

Publication Title

2025 34th International Conference on Parallel Architectures and Compilation Techniques (PACT)

Recommended Citation

Wang, Y., Zhang, Y., Liu, Z., Qiu, J., & Wang, Z. (2025). ANG: Accelerating NFA processing on GPUs via Exploring Multi-Level Fine-Grained Parallelism. 2025 34th International Conference on Parallel Architectures and Compilation Techniques (PACT). http://doi.org/10.1109/PACT65351.2025.00023
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p2/2693

Michigan Tech Publications

ANG: Accelerating NFA processing on GPUs via Exploring Multi-Level Fine-Grained Parallelism

Document Type

Publication Date

Department

Abstract

Publication Title

Recommended Citation

LINKS

Browse

Search

Graduate Students

Author Corner

Michigan Tech Publications

ANG: Accelerating NFA processing on GPUs via Exploring Multi-Level Fine-Grained Parallelism

Authors

Document Type

Publication Date

Department

Abstract

Publication Title

Recommended Citation

Share

LINKS

Browse

Search

Graduate Students

Author Corner