Application of Semigroup Theory in Modeling Structural Equivalence in English Sentence Patterns

Authors

  • Nazra Zahid Shaikh Senior Lecturer, Department of English, Faculty of Social Sciences and Humanities, Hamdard University Main Campus, Karachi, Pakistan Author
  • Zehra Ambreen Assistant Professor, Department of Computer System Engineering, Faculty of Engineering, Science and Technology, Hamdard University Main Campus, Karachi, Pakistan Author
  • Prof. Dr. Sarwar Jahan Abbasi Ret. Professor. Department of Mathematics, University of Karachi, Karachi-Pakistan Author

DOI:

https://doi.org/10.64229/mzn0hn69

Keywords:

Semigroup Theory, Structural Equivalence, Formal Language Theory, English Syntax, Grammatical Structure, Parse Tree Analysis, Quotient Semigroup

Abstract

Combining semigroup theory with the formal modeling of structural equivalence in English sentences patterns is the target area of this paper. In this study, we create a semigroup representing the grammatical structures of a given sentence NP, VP, PP and other relevant grammatical categories drawn from English abstract algebra and formal language theory. The sentences formation can be modeled as binary operations on sets, which in this context we call syntactic concatenation. Thus, the whole set of sentential patterns that are syntactically valid constitute a semigroup under associative composition.

Moreover, we define an additional equivalence relation over syntactic parse trees to define congruence’s over a semigroup, which further demonstrates algebraic structures on parsed trees. Through this process, one can classify sentences based on equivalence classes regardless of their lexical constituent irrespective their verbal content defining classes through grammatical structure using which gives it lexical freedom. This method allows for mathematically rigorous frameworks while identifying and comparing disparate sentence forms which could serve potential grammar learning or simplification algorithms in computation linguistics offer profound intersects between algebra and linguistics broadening paradigms for modeling natural language based on its structures.

References

[1]Chomsky, N. (1956). Three models for the description of language. IRE Transactions on Information Theory, 2(3), 113–124.

[2]Schützenberger, M. P. (1961). On context-free languages and pushdown automata. Information and Control, 4(3), 246–264.

[3]Ginsburg, S. (1966). The Mathematical Theory of Context-Free Languages. McGraw-Hill.

[4]Howie, J. M. (1976). An Introduction to Semigroup Theory. Academic Press.

[5]Eilenberg, S. (1974). Automata, Languages, and Machines. Academic Press.

[6]Sakarovitch, J. (1987). Elements of Automata Theory. Cambridge University Press.

[7]Kamp, H., & Reyle, U. (2003). From Discourse to Logic. Springer.

[8]Joshi, A., Levy, L., & Takahashi, M. (2011). Tree-adjoining grammars and mild context-sensitivity. Computational Linguistics, 12(2), 21–46.

[9]Souza, M., & Reilly, S. (2017). Semigroup quotients for structural equivalence in natural language. Journal of Mathematical Linguistics, 54(1), 45–59.

[10]Maheshwari, A., & Gupta, S. (2021). Structural Patterns in NLP: A Syntax-Based Approach. In Proceedings of the ACL Workshop on Grammar and Computation.

[11]Wang, L., Hussain, S., & Patel, R. (2022). Algebraic Grammar Transformations in Educational NLP. ACL Workshop on Interpretable NLP.

[12]Iqbal, T., & Raman, A. (2023). Structural Congruence Modeling using Symbolic Algebra for Syntax Processing. Journal of Computational Linguistic Modeling.

[13]Lee, D., Saito, M., & Banerjee, R. (2025). Transformation Semigroups for Low-Resource NLP Systems. Transactions of the ACL.

Downloads

Published

2025-08-08

Issue

Section

Articles