Sentiment Analysis using Logistic Regression

This project demonstrates how to build a sentiment analysis model that classifies customer reviews into positive and negative categories using machine learning.

📑 Summary

Firstly,the data is loaded in mdata and then the preprocessing applies. In preprocessig we use corpus array to hold all the filtered words from the reviews column.

This filtered words are obtained from re module to clear out all the special characters and symbols , then we convert it to lower and split it.

The most important part is Feature-Scaling using TF-IDF which enhance the ability of the model to distinguish b/w positive and negative sentiment . Then the LogisticRegression model is trained .

Also there is one point to notice that considerebly affects the outcome . The CountVectorizer() gives like 14 False Negative and 71 False Positive in contusion matrix , which clarifies that the model is gonna be affected due to the False Positive result . And since Tf-IDF out performs the Fasle Positive result by minimizing it to 6 so TF-IDF is used for vectorizing purpose.

📂 Project Structure

Sentiment_analysis_Logistic.ipynb
Jupyter notebook containing the full workflow: preprocessing, feature extraction, model training, and evaluation.
TestReviews.csv
Test dataset with sample reviews and sentiment labels.

⚙️ Workflow

Data Preparation
- Load training data from text files.
- It has assigned labels:
  - 1 → Positive
  - 0 → Negative
Preprocessing
- Convert text to lowercase
- Remove special characters and numbers
- Remove stopwords (keeping negations like not)
- Apply stemming
Feature Extraction
- Transform reviews into numerical vectors using TF-IDF or CountVectorizer.
Model Training
- Use Logistic Regression to classify reviews.
- Balanced class weights to handle uneven data distribution.
Evaluation
- Tested on TestReviews.csv.
- Metrics used: Accuracy, Confusion Matrix.

📊 Example Results

Accuracy: ~85–90% (depending on parameter tuning)
The model correctly identifies positive and negative reviews with good balance.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
Sentiment_analysis_Logistic.ipynb		Sentiment_analysis_Logistic.ipynb
TestReviews.csv		TestReviews.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sentiment Analysis using Logistic Regression

📑 Summary

📂 Project Structure

⚙️ Workflow

📊 Example Results

About

Uh oh!

Releases

Packages

Languages

ritesh-begin/Sentiment_analysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis using Logistic Regression

📑 Summary

📂 Project Structure

⚙️ Workflow

📊 Example Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages