Credit Card Churning Customer Analysis And Prediction Using Machine Learning And Deep Learning With Python


Download Credit Card Churning Customer Analysis And Prediction Using Machine Learning And Deep Learning With Python PDF/ePub or read online books in Mobi eBooks. Click Download or Read Online button to get Credit Card Churning Customer Analysis And Prediction Using Machine Learning And Deep Learning With Python book now. This website allows unlimited access to, at the time of writing, more than 1.5 million titles, including hundreds of thousands of titles in various foreign languages.

Download

Credit Card Churning Customer Analysis and Prediction Using Machine Learning and Deep Learning with Python


Credit Card Churning Customer Analysis and Prediction Using Machine Learning and Deep Learning with Python

Author: Vivian Siahaan

language: en

Publisher: BALIGE PUBLISHING

Release Date: 2023-07-18


DOWNLOAD





The project "Credit Card Churning Customer Analysis and Prediction Using Machine Learning and Deep Learning with Python" involved a comprehensive analysis and prediction task focused on understanding customer attrition in a credit card churning scenario. The objective was to explore a dataset, visualize the distribution of features, and predict the attrition flag using both machine learning and artificial neural network (ANN) techniques. The project began by loading the dataset containing information about credit card customers, including various features such as customer demographics, transaction details, and account attributes. The dataset was then explored to gain a better understanding of its structure and contents. This included checking the number of records, identifying the available features, and inspecting the data types. To gain insights into the data, exploratory data analysis (EDA) techniques were employed. This involved examining the distribution of different features, identifying any missing values, and understanding the relationships between variables. Visualizations were created to represent the distribution of features. These visualizations helped identify any patterns, outliers, or potential correlations in the data. The target variable for prediction was the attrition flag, which indicated whether a customer had churned or not. The dataset was split into input features (X) and the target variable (y) accordingly. Machine learning algorithms were then applied to predict the attrition flag. Various classifiers such as Logistic Regression, Decision Trees, Random Forests, Support Vector Machines (SVM), K-Nearest Neighbors (NN), Gradient Boosting, Extreme Gradient Boosting, Light Gradient Boosting, were utilized. These models were trained using the training dataset and evaluated using appropriate performance metrics. Model evaluation involved measuring the accuracy, precision, recall, and F1-score of each classifier. These metrics provided insights into how well the models performed in predicting customer attrition. Additionally, a confusion matrix was created to analyze the true positive, true negative, false positive, and false negative predictions. This matrix allowed for a deeper understanding of the classifier's performance and potential areas for improvement. Next, a deep learning approach using an artificial neural network (ANN) was employed for attrition flag prediction. The dataset was preprocessed, including features normalization, one-hot encoding of categorical variables, and splitting into training and testing sets. The ANN model architecture was defined, consisting of an input layer, one or more hidden layers, and an output layer. The number of nodes and activation functions for each layer were determined based on experimentation and best practices. The ANN model was compiled by specifying the loss function, optimizer, and evaluation metrics. Common choices for binary classification problems include binary cross-entropy loss and the Adam optimizer. The model was then trained using the training dataset. The training process involved feeding the input features and target variable through the network, updating the weights and biases using backpropagation, and repeating this process for multiple epochs. During training, the model's performance on both the training and validation sets was monitored. This allowed for the detection of overfitting or underfitting and the adjustment of hyperparameters, such as the learning rate or the number of hidden layers, if necessary. The accuracy and loss values were plotted over the epochs to visualize the training and validation performance of the ANN. These plots provided insights into the model's convergence and potential areas for improvement. After training, the model was used to make predictions on the test dataset. A threshold of 0.5 was applied to the predicted probabilities to classify the predictions as either churned or not churned customers. The accuracy score was calculated by comparing the predicted labels with the true labels from the test dataset. Additionally, a classification report was generated, including metrics such as precision, recall, and F1-score for both churned and not churned customers. To further evaluate the model's performance, a confusion matrix was created. This matrix visualized the true positive, true negative, false positive, and false negative predictions, allowing for a more detailed analysis of the model's predictive capabilities. Finally, a custom function was utilized to create a plot comparing the predicted values to the true values for the attrition flag. This plot visualized the accuracy of the model and provided a clear understanding of how well the predictions aligned with the actual values. Through this comprehensive analysis and prediction process, valuable insights were gained regarding customer attrition in credit card churning scenarios. The machine learning and ANN models provided predictions and performance metrics that can be used for decision-making and developing strategies to mitigate attrition. Overall, this project demonstrated the power of machine learning and deep learning techniques in understanding and predicting customer behavior. By leveraging the available data, it was possible to uncover patterns, make accurate predictions, and guide business decisions aimed at retaining customers and reducing attrition in credit card churning scenarios.

ANALYSIS AND PREDICTION PROJECTS USING MACHINE LEARNING AND DEEP LEARNING WITH PYTHON


ANALYSIS AND PREDICTION PROJECTS USING MACHINE LEARNING AND DEEP LEARNING WITH PYTHON

Author: Vivian Siahaan

language: en

Publisher: BALIGE PUBLISHING

Release Date: 2022-02-17


DOWNLOAD





PROJECT 1: DEFAULT LOAN PREDICTION BASED ON CUSTOMER BEHAVIOR Using Machine Learning and Deep Learning with Python In finance, default is failure to meet the legal obligations (or conditions) of a loan, for example when a home buyer fails to make a mortgage payment, or when a corporation or government fails to pay a bond which has reached maturity. A national or sovereign default is the failure or refusal of a government to repay its national debt. The dataset used in this project belongs to a Hackathon organized by "Univ.AI". All values were provided at the time of the loan application. Following are the features in the dataset: Income, Age, Experience, Married/Single, House_Ownership, Car_Ownership, Profession, CITY, STATE, CURRENT_JOB_YRS, CURRENT_HOUSE_YRS, and Risk_Flag. The Risk_Flag indicates whether there has been a default in the past or not. The machine learning models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, Adaboost, LGBM classifier, Gradient Boosting, XGB classifier, MLP classifier, and CNN 1D. Finally, you will plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 2: AIRLINE PASSENGER SATISFACTION Analysis and Prediction Using Machine Learning and Deep Learning with Python The dataset used in this project contains an airline passenger satisfaction survey. In this case, you will determine what factors are highly correlated to a satisfied (or dissatisfied) passenger and predict passenger satisfaction. Below are the features in the dataset: Gender: Gender of the passengers (Female, Male); Customer Type: The customer type (Loyal customer, disloyal customer); Age: The actual age of the passengers; Type of Travel: Purpose of the flight of the passengers (Personal Travel, Business Travel); Class: Travel class in the plane of the passengers (Business, Eco, Eco Plus); Flight distance: The flight distance of this journey; Inflight wifi service: Satisfaction level of the inflight wifi service (0:Not Applicable;1-5); Departure/Arrival time convenient: Satisfaction level of Departure/Arrival time convenient; Ease of Online booking: Satisfaction level of online booking; Gate location: Satisfaction level of Gate location; Food and drink: Satisfaction level of Food and drink; Online boarding: Satisfaction level of online boarding; Seat comfort: Satisfaction level of Seat comfort; Inflight entertainment: Satisfaction level of inflight entertainment; On-board service: Satisfaction level of On-board service; Leg room service: Satisfaction level of Leg room service; Baggage handling: Satisfaction level of baggage handling; Check-in service: Satisfaction level of Check-in service; Inflight service: Satisfaction level of inflight service; Cleanliness: Satisfaction level of Cleanliness; Departure Delay in Minutes: Minutes delayed when departure; Arrival Delay in Minutes: Minutes delayed when Arrival; and Satisfaction: Airline satisfaction level (Satisfaction, neutral or dissatisfaction) The machine learning models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, LGBM classifier, Gradient Boosting, XGB classifier, MLP classifier, and CNN 1D. Finally, you will plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 3: CREDIT CARD CHURNING CUSTOMER ANALYSIS AND PREDICTION USING MACHINE LEARNING AND DEEP LEARNING WITH PYTHON The dataset used in this project consists of more than 10,000 customers mentioning their age, salary, marital_status, credit card limit, credit card category, etc. There are 20 features in the dataset. In the dataset, there are only 16.07% of customers who have churned. Thus, it's a bit difficult to train our model to predict churning customers. Following are the features in the dataset: 'Attrition_Flag', 'Customer_Age', 'Gender', 'Dependent_count', 'Education_Level', 'Marital_Status', 'Income_Category', 'Card_Category', 'Months_on_book', 'Total_Relationship_Count', 'Months_Inactive_12_mon', 'Contacts_Count_12_mon', 'Credit_Limit', 'Total_Revolving_Bal', 'Avg_Open_To_Buy', 'Total_Amt_Chng_Q4_Q1', 'Total_Trans_Amt', 'Total_Trans_Ct', 'Total_Ct_Chng_Q4_Q1', and 'Avg_Utilization_Ratio',. The target variable is 'Attrition_Flag'. The machine learning models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, LGBM classifier, Gradient Boosting, XGB classifier, MLP classifier, and CNN 1D. Finally, you will plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 4: MARKETING ANALYSIS AND PREDICTION USING MACHINE LEARNING AND DEEP LEARNING WITH PYTHON This data set was provided to students for their final project in order to test their statistical analysis skills as part of a MSc. in Business Analytics. It can be utilized for EDA, Statistical Analysis, and Visualizations. Following are the features in the dataset: ID = Customer's unique identifier; Year_Birth = Customer's birth year; Education = Customer's education level; Marital_Status = Customer's marital status; Income = Customer's yearly household income; Kidhome = Number of children in customer's household; Teenhome = Number of teenagers in customer's household; Dt_Customer = Date of customer's enrollment with the company; Recency = Number of days since customer's last purchase; MntWines = Amount spent on wine in the last 2 years; MntFruits = Amount spent on fruits in the last 2 years; MntMeatProducts = Amount spent on meat in the last 2 years; MntFishProducts = Amount spent on fish in the last 2 years; MntSweetProducts = Amount spent on sweets in the last 2 years; MntGoldProds = Amount spent on gold in the last 2 years; NumDealsPurchases = Number of purchases made with a discount; NumWebPurchases = Number of purchases made through the company's web site; NumCatalogPurchases = Number of purchases made using a catalogue; NumStorePurchases = Number of purchases made directly in stores; NumWebVisitsMonth = Number of visits to company's web site in the last month; AcceptedCmp3 = 1 if customer accepted the offer in the 3rd campaign, 0 otherwise; AcceptedCmp4 = 1 if customer accepted the offer in the 4th campaign, 0 otherwise; AcceptedCmp5 = 1 if customer accepted the offer in the 5th campaign, 0 otherwise; AcceptedCmp1 = 1 if customer accepted the offer in the 1st campaign, 0 otherwise; AcceptedCmp2 = 1 if customer accepted the offer in the 2nd campaign, 0 otherwise; Response = 1 if customer accepted the offer in the last campaign, 0 otherwise; Complain = 1 if customer complained in the last 2 years, 0 otherwise; and Country = Customer's location. The machine and deep learning models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, LGBM classifier, Gradient Boosting, XGB classifier, MLP classifier, and CNN 1D. Finally, you will plot boundary decision, ROC, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy. PROJECT 5: METEOROLOGICAL DATA ANALYSIS AND PREDICTION USING MACHINE LEARNING WITH PYTHON Meteorological phenomena are described and quantified by the variables of Earth's atmosphere: temperature, air pressure, water vapour, mass flow, and the variations and interactions of these variables, and how they change over time. Different spatial scales are used to describe and predict weather on local, regional, and global levels. The dataset used in this project consists of meteorological data with 96453 total number of data points and with 11 attributes/columns. Following are the columns in the dataset: Formatted Date; Summary; Precip Type; Temperature (C); Apparent Temperature (C); Humidity; Wind Speed (km/h); Wind Bearing (degrees); Visibility (km); Pressure (millibars); and Daily Summary. The machine learning models used in this project are K-Nearest Neighbor, Random Forest, Naive Bayes, Logistic Regression, Decision Tree, Support Vector Machine, LGBM classifier, Gradient Boosting, XGB classifier, and MLP classifier. Finally, you will plot boundary decision, distribution of features, feature importance, cross validation score, and predicted values versus true values, confusion matrix, learning curve, performance of the model, scalability of the model, training loss, and training accuracy.

Big Data Analytics


Big Data Analytics

Author: Arun K. Somani

language: en

Publisher: CRC Press

Release Date: 2017-10-30


DOWNLOAD





The proposed book will discuss various aspects of big data Analytics. It will deliberate upon the tools, technology, applications, use cases and research directions in the field. Chapters would be contributed by researchers, scientist and practitioners from various reputed universities and organizations for the benefit of readers.