Processing dataset to the json and txt files that are easy to process in the following steps. Training the TFIDF, LDA and fine-tuned sentence-BERT models. python main.py --phase lda --train_model ...