I am Mostafa Rahgouy. I was born in 23 December 1994 at Eilam. My interest in computer science begins with computer games. It was interesting to me how computer games can play intelligently with us and how it is possible for a game to contain easy, hard, and medium levels. Based on this curiosity, I chose Computer Science as a field of study at the University of Mohaghegh Ardabili.

In the undergraduate period, the most important course which gave me insight into the Computer Science world was Topics in Computer Science and Data Mining. In the Topics in Computer Science course, I studied Artificial Intelligent concepts( e.g. Greedy Search, Genetics, and Neural Networks algorithms) especially with Data Mining I got to know methods that are not conventional. I learned problems that are not algorithmic and they contain learning methods. This was so interesting to me because it was similar to human learning.

In this way, I get familiarize with Machine Learning world. All of these things, aspire me to go after PAN competitions on digital text forensics and get experinced on Machine Learning and Natural Language Processing.

Based on my curiosity, I am very interested and enthusiastic to learn and do research in areas including Machine Learning, Natural Language Processing, and Deep Learning.

Email: mostafarahgouy [at] student[dot] uma [dot] ac [dot] ir

Publication

Cross-domain Authorship Attribution: Author Identification using a Multi-Aspect Ensemble Approach - (CLEF 2019)

M Rahgouy, HB Giglou, T Rahgouy, MK Sheykhlan, E Mohammadzadeh

Abstract:

Author Attribution (AA) as one of the most important tasks of authorship analysis attracted huge body of research in recent years. In this task, given a document, the goal is to identify its author from a set of known authors and samples of their writings. In PAN 2019 shared tasks, the AA task is expanded in two ways. First, by having documents written by authors other than the known authors (UNK documents). Second, using a cross-domain set of documents. The task baseline and previous works mainly rely on character-level representation of documents because of their better generalization capability across different languages and domains. However, we hypothesize that ignoring coarse-grain features of documents may lead to loss of valuable information about the author's style. In this paper we propose an ensemble approach that combines models built upon different levels of document representation in order to investigate this hypothesis. Experimental results presented in this paper show that the coarse-grained representations of documents play an important role in identifying the authors style alongside the fine-grained representations.

Author Profiling: Bots and Gender Prediction using a Multi-Aspect Ensemble Approach - (CLEF 2019)

HB Giglou, M Rahgouy, T Rahgooy, MK Sheykhlan, E Mohammadzadeh

Abstract:

Author Profiling is one of the most important tasks in authorship analysis. In PAN 2019 shared tasks, the gender identification of the author is the main focus. Compared to the previous year the author profiling task is expended by having documents written by bots. In order to tackle this new challenge we propose a two phase approach. In the first phase we exploit the TF-IDF features of the documents to train a model that learns to detect documents generated by bots. Next, we train three models on character-level and word-level representations of the documents and aggregate their results using majority voting. Finally, we empirically show the effectiveness of our proposed approach on the PAN 2019 development dataset for author profiling.

Author Masking Directed by Author’s Style - (CLEF 2018)

M Rahgouy, HB Giglou, T Rahgooy, H Zeynali, SKM Rasooli

Abstract:

Author verification algorithms mainly rely on learning statistical fingerprints of authors. In the other hand, most of the previous algorithms in author masking try to apply changes to the original texts blindly without considering finger-prints and uses them to apply directed transformations and distortions to the original text.We represent author finger-prints with different statistics such as word choice distribution, sentence length preference, etc. obtained from author’s known texts. Automatic and manual evaluations of the obfuscated texts are very promising and show the effectiveness of our approach.

Mostafa Rahgouy

Welcome to my personal website!

Based on my curiosity, I am very interested and enthusiastic to learn and do research in areas including Machine Learning, Natural Language Processing, and Deep Learning.

Education

Publication

Skills

Python

C / C++

C#

Java

HTML && CSS

Matlab

Machine Learning

Data Mining

Natural Language Processing

DBMS

IDE's

Version Control

Oparating Systems

Others

Connect