TY - GEN
T1 - Efficient parallel CKY parsing on GPUs
AU - Yi, Youngmin
AU - Lai, Chao Yue
AU - Petrov, Slav
AU - Keutzer, Kurt
N1 - Publisher Copyright:
© 2011 Association for Computational Linguistics
PY - 2011
Y1 - 2011
N2 - Low-latency solutions for syntactic parsing are needed if parsing is to become an integral part of user-facing natural language applications. Unfortunately, most state-of-the-art constituency parsers employ large probabilistic context-free grammars for disambiguation, which renders them impractical for real-time use. Meanwhile, Graphics Processor Units (GPUs) have become widely available, offering the opportunity to alleviate this bottleneck by exploiting the fine-grained data parallelism found in the CKY algorithm. In this paper, we explore the design space of parallelizing the dynamic programming computations carried out by the CKY algorithm. We use the Compute Unified Device Architecture (CUDA) programming model to reimplement a state-of-the-art parser, and compare its performance on two recent GPUs with different architectural features. Our best results show a 26-fold speedup compared to a sequential C implementation.
AB - Low-latency solutions for syntactic parsing are needed if parsing is to become an integral part of user-facing natural language applications. Unfortunately, most state-of-the-art constituency parsers employ large probabilistic context-free grammars for disambiguation, which renders them impractical for real-time use. Meanwhile, Graphics Processor Units (GPUs) have become widely available, offering the opportunity to alleviate this bottleneck by exploiting the fine-grained data parallelism found in the CKY algorithm. In this paper, we explore the design space of parallelizing the dynamic programming computations carried out by the CKY algorithm. We use the Compute Unified Device Architecture (CUDA) programming model to reimplement a state-of-the-art parser, and compare its performance on two recent GPUs with different architectural features. Our best results show a 26-fold speedup compared to a sequential C implementation.
UR - http://www.scopus.com/inward/record.url?scp=85080793869&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85080793869
T3 - IWPT 2011 - Proceedings of the 12th International Conference on Parsing Technologies
SP - 175
EP - 185
BT - IWPT 2011 - Proceedings of the 12th International Conference on Parsing Technologies
PB - Association for Computational Linguistics (ACL)
T2 - 12th International Conference on Parsing Technologies, IWPT 2011
Y2 - 5 October 2011 through 7 October 2011
ER -