Comparative Analysis of Long Short-Term Memory Architecture for Text Classification

Moh Fajar Abdillah(1); Kusnawi Kusnawi(2*);

(1) Universitas Amikom Yogyakarta
(2) Universitas Amikom Yogyakarta
(*) Corresponding Author



Text classification which is a part of NLP is a grouping of objects in the form of text based on certain characteristics that show similarities between one document and another. One of methods used in text classification is LSTM. The performance of the LSTM method itself is influenced by several things such as datasets, architecture, and tools used to classify text. On this occasion, researchers analyse the effect of the number of layers in the LSTM architecture on the performance generated by the LSTM method. This research uses IMDB movie reviews data with a total of 50,000 data. The data consists of positive, negative data and there is data that does not yet have a label. IMDB Movie Reviews data go through several stages as follows: Data collection, data pre-processing, conversion to numerical format, text embedding using the pre-trained word embedding model: Fastext, train and test classification model using LSTM, finally validate and test the model so that the results are obtained from the stages of this research. The results of this study show that the one-layer LSTM architecture has the best accuracy compared to two-layer and three-layer LSTM with training accuracy and testing accuracy of one-layer LSTM which are 0.856 and 0.867. While the training accuracy and testing accuracy on two-layer LSTM are 0.846 and 0.854, the training accuracy and testing accuracy on three layers are 0.848 and 864.


FastText; LSTM; NLP; Text Classification


Article Metrics

Abstract view: 178 times
PDF view: 87 times

Copyright (c) 2023 Moh Fajar Abdillah, Kusnawi Kusnawi

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.