Consolidation of reports and matching of documents pose several challenges especially when dealing with large amounts of textual data. Thus, organizations are in need of intelligent systems that are capable of automating these processes, ensuring faster, more accurate analysis and retrieval of relevant information. This study applies Latent Semantic Indexing (LSI) and Cosine Similarity to automate the matching of gender-related issues, activities, and programs submitted by university offices. An intelligent web-based system was developed using Python and Django to implement these algorithms for report consolidation. Performance evaluation using accuracy, precision, recall, and F1-score demonstrated that the model correctly classified 90% of entries. A threshold sweep experiment further revealed that a similarity value of 0.51 provides the optimal decision boundary for identifying semantically similar instances. The findings confirm that LSI remains effective for low-resource institutional text analysis, enabling more efficient and accurate report consolidation.
DOI: 10.11591/ijeecs.v42.i2.pp609-618
Publication Date: 2026-05-01