Mostafa Rahgouy

I am Mostafa Rahgouy. I was born in 23 December 1994 at Eilam. My interest in computer science begins with computer games. It was interesting to me how computer games can play intelligently with us and how it is possible for a game to contain easy, hard, and medium levels. Based on this curiosity, I chose Computer Science as a field of study at the University of Mohaghegh Ardabili.

In the undergraduate period, the most important course which gave me insight into the Computer Science world was Topics in Computer Science and Data Mining. In the Topics in Computer Science course, I studied Artificial Intelligent concepts( e.g. Greedy Search, Genetics, and Neural Networks algorithms) especially with Data Mining I got to know methods that are not conventional. I learned problems that are not algorithmic and they contain learning methods. This was so interesting to me because it was similar to human learning.

In this way, I get familiarize with Machine Learning world. All of these things, aspire me to go after PAN competitions on digital text forensics and get experinced on Machine Learning and Natural Language Processing.

Based on my curiosity, I am very interested and enthusiastic to learn and do research in areas including Machine Learning, Natural Language Processing, and Deep Learning.

Email: mostafarahgouy [at] student[dot] uma [dot] ac [dot] ir

Education


B.Sc in Computer Science

University of Mohaghegh Ardabili, Ardabil, Iran

2014 - 2019

I studied the Computer Science at the University of Mohaghegh Ardabili with overall GPA's 17.10 (out of 20).

University of Mohaghegh Ardabili website

Publication


Cross-domain Authorship Attribution: Author Identification using a Multi-Aspect Ensemble Approach - (CLEF 2019)

M Rahgouy, HB Giglou, T Rahgouy, MK Sheykhlan, E Mohammadzadeh

Abstract:

Author Attribution (AA) as one of the most important tasks of authorship analysis attracted huge body of research in recent years. In this task, given a document, the goal is to identify its author from a set of known authors and samples of their writings. In PAN 2019 shared tasks, the AA task is expanded in two ways. First, by having documents written by authors other than the known authors (UNK documents). Second, using a cross-domain set of documents. The task baseline and previous works mainly rely on character-level representation of documents because of their better generalization capability across different languages and domains. However, we hypothesize that ignoring coarse-grain features of documents may lead to loss of valuable information about the author's style. In this paper we propose an ensemble approach that combines models built upon different levels of document representation in order to investigate this hypothesis. Experimental results presented in this paper show that the coarse-grained representations of documents play an important role in identifying the authors style alongside the fine-grained representations.


Author Profiling: Bots and Gender Prediction using a Multi-Aspect Ensemble Approach - (CLEF 2019)

HB Giglou, M Rahgouy, T Rahgooy, MK Sheykhlan, E Mohammadzadeh

Abstract:

Author Profiling is one of the most important tasks in authorship analysis. In PAN 2019 shared tasks, the gender identification of the author is the main focus. Compared to the previous year the author profiling task is expended by having documents written by bots. In order to tackle this new challenge we propose a two phase approach. In the first phase we exploit the TF-IDF features of the documents to train a model that learns to detect documents generated by bots. Next, we train three models on character-level and word-level representations of the documents and aggregate their results using majority voting. Finally, we empirically show the effectiveness of our proposed approach on the PAN 2019 development dataset for author profiling.


Author Masking Directed by Author’s Style - (CLEF 2018)

M Rahgouy, HB Giglou, T Rahgooy, H Zeynali, SKM Rasooli

Abstract:

Author verification algorithms mainly rely on learning statistical fingerprints of authors. In the other hand, most of the previous algorithms in author masking try to apply changes to the original texts blindly without considering finger-prints and uses them to apply directed transformations and distortions to the original text.We represent author finger-prints with different statistics such as word choice distribution, sentence length preference, etc. obtained from author’s known texts. Automatic and manual evaluations of the obfuscated texts are very promising and show the effectiveness of our approach.


Skills

Programming

Python

  • Intermediate
  • Jupyter-notebook

C / C++

  • Advanced
  • QT GUI designer

Chess Project Github Repository
C#

  • Intermediate
  • Asp.Net MVC framework

Website Project Github Repository
HTML && CSS

  • Basic
  • HTML5 , CSS3

Website Project Github Repository
Matlab

  • Basic

Computer Tools and Packages

Machine Learning

  • Scikit-learn

Data Mining

  • Weka, Matlab

Course Project Github Repository
Natural Language Processing

  • NLTK, Gensim, VADER

DBMS

  • Microsoft SQL Server

IDE's

  • Microsoft Visual Studio, PyChram, IntelliJ IDEA, Jupyter Notebook

Version Control

  • Git

Oparating Systems

  • Linux(Ubuntu), Microsoft Windows

Others

  • Microsoft Word, Latax


Connect