[Big Data Analytics, Text Analysis, Sentiment Analysis, Cloud Computing]
· Big Data Processing with Spark RDDs
· Large-Scale Data Processing with Spark on the Cloud
· Data Modeling and Optimizations
· Optimization Methods
· Large-Scale Supervised Learning
· Cloud Storage (AWS S3)
· Cloud Analytics (AWS EMR)
· Data Preprocessing (Tokenization, Stemming, Bag-of-Words)
· Exploratory Data Analysis (Sentiment Trend Analysis, Labeling Polarity for Classification)
· Classification Models with TF-IDF Vectors (Logistic Regression, Naive Bayes, Support Vector Machine)
· Classification Evaluation Metrics (Accuracy, Precision, TPR Recall, F-Score)