The Data Science with Python course will furnish you with in-depth knowledge of the various libraries and packages required to perform data analysis, data visualization, web scraping, machine learning and natural language processing using Python. 

 

The Python for Data Science course is packed with real-life projects focused on customer segmentation, macro calls, attrition analysis, and retail analysis, as well as demos and case studies to give you practical experience in installing and working in the Python environment.

Python Data Science Course duration: 180 hours (At least 72 hours live training + Practice and Self-study, with ~8hrs of weekly self-study).

 

Who Should do this course?

Candidates from various quantitative backgrounds, like Engineering, Finance, Maths, Statistics, Business Management who are not just looking for any Python course, but want Python training with advanced analytics and machine learning skills to head start their career in the field of Data science.

 

Course Curriculum

No curriculum found !

Course Preview

An Overview of Analytics & Data Science
•What is analytics & Data Science?
•Common Terms in Analytics
•Analytics vs. Data warehousing, OLAP, MIS Reporting
•Relevance in industry and need of the hour
•Types of problems and business objectives in various industries
•How leading companies are harnessing the power of analytics?
•Critical success drivers
•Overview of analytics tools & their popularity
•Analytics Methodology & problem solving framework
•List of steps in Analytics projects
•Identify the most appropriate solution design for the given problem statement
•Project plan for Analytics project & key milestones based on effort estimates
•Build Resource plan for analytics project
•Why Python for data science?

Python Essentials (Core)
•Overview of Python- Starting with Python
•Introduction to installation of Python
•Introduction to Python Editors & IDE’s(Canopy, pycharm,  Jupyter, Rodeo, Ipython etc…)
•Understand Jupyter notebook & Customize Settings
•Concept of Packages/Libraries – Important packages(NumPy, SciPy, scikit-learn, Pandas, Matplotlib, etc)
•Installing & loading Packages & Name Spaces
•Data Types & Data objects/structures (strings, Tuples, Lists, Dictionaries)
•List and Dictionary Comprehensions
•Variable & Value Labels –  Date & Time Values
•Basic Operations – Mathematical – string – date
•Control flow & conditional statements
•Debugging & Code profiling
•How to create class and modules and how to call them?

Accessing/Importing and Exporting Data using python modules
•Importing Data from various sources (Csv, txt, excel, access etc)
•Database Input (Connecting to database)
•Viewing Data objects –  subsetting, methods
•Exporting Data to various  formats
•Important python modules: Pandas, beautifulsoup
 
Data Manipulation – cleansing – Munging using Python modules
•Cleansing Data with Python
•Data Manipulation steps(Sorting, filtering, duplicates, merging, appending, subsetting, derived variables, sampling, Data type conversions, renaming, formatting etc)
•Data manipulation tools(Operators, Functions, Packages, control structures, Loops, arrays etc)
•Python Built-in Functions (Text, numeric, date, utility functions)
•Python User Defined Functions
•Stripping out extraneous information
•Normalizing data
•Formatting data
•Important Python modules for data manipulation (Pandas, Numpy, re, math, string, datetime etc)
Data Analysis – Visualization using Python
•Introduction exploratory data analysis
•Descriptive statistics, Frequency Tables and summarization
•Univariate Analysis (Distribution of data & Graphical Analysis)
•Bivariate Analysis(Cross Tabs, Distributions & Relationships, Graphical Analysis)
 
Data Analysis – Visualization using Python
•Creating Graphs- Bar/pie/line chart/histogram/ boxplot/ scatter/ density etc)
•Important Packages for Exploratory Analysis(NumPy Arrays, Matplotlib, seaborn, Pandas and scipy.stats etc)

Basic statistics & implementation of stats methods in python
•Basic Statistics – Measures of Central Tendencies and Variance
•Building blocks – Probability Distributions – Normal distribution – Central Limit Theorem
•Inferential Statistics -Sampling – Concept of Hypothesis Testing
•Statistical Methods – Z/t-tests (One sample, independent, paired), Anova, Correlation and Chi-square
•Important modules for statistical methods: Numpy, Scipy, Pandas

Machine Learning -Predictive Modeling – Basics 
•Introduction to Machine Learning & Predictive Modeling
•Types of Business problems – Mapping of Techniques – Regression vs. classification vs. segmentation vs. Forecasting
•Major Classes of Learning Algorithms -Supervised vs Unsupervised Learning
•Different Phases of Predictive Modeling (Data Pre-processing, Sampling, Model Building, Validation)
•Overfitting (Bias-Variance Trade off) & Performance Metrics
•Feature engineering & dimension reduction
•Concept of optimization & cost function
•Overview of gradient descent algorithm
•Overview of Cross validation(Bootstrapping, K-Fold validation etc)
     •Model performance metrics (R-square, Adjusted R-squre, RMSE, MAPE, AUC, ROC curve, recall, precision, sensitivity, specificity, confusion metrics )
 
Data Exploration for modeling
•Need for structured exploratory data
•EDA framework for exploring the data and identifying any problems with the data (Data Audit Report)
•identify missing data
•identify outliers data
•Visualize the data trends and patterns
 
Data Preparation
•Need of Data preparation
•Consolidation/Aggregation – Outlier treatment – Flat Liners – Missing values- Dummy creation – Variable Reduction
•Variable Reduction Techniques – Factor & PCA Analysis

Linear Regression: Solving regression problems
•Introduction – Applications
•Assumptions of Linear Regression
•Building Linear Regression Model
•Understanding standard metrics (Variable significance, R-square/Adjusted R-square, Global hypothesis ,etc)
•Assess the overall effectiveness of the model
•Validation of Models (Re running Vs. Scoring)
•Standard Business Outputs (Decile Analysis, Error distribution (histogram), Model equation, drivers etc.)
•Interpretation of Results – Business Validation – Implementation on new data

Logistic Regression: Solving classification problems
•Introduction – Applications
•Linear Regression Vs. Logistic Regression Vs. Generalized Linear Models
•Building Logistic Regression Model (Binary Logistic Model)
•Understanding standard model metrics (Concordance, Variable significance, Hosmer Lemeshov Test, Gini, KS, Misclassification, ROC Curve etc)
•Validation of Logistic Regression Models (Re running Vs. Scoring)
•Standard Business Outputs (Decile Analysis, ROC Curve, Probability Cut-offs, Lift charts, Model equation, Drivers or variable importance, etc)
•Interpretation of Results – Business Validation – Implementation on new data

Segmentation: Solving segmentation problems
•Introduction to Segmentation
•Types of Segmentation (Subjective Vs Objective, Heuristic Vs. Statistical)
•Heuristic Segmentation Techniques (Value Based, RFM Segmentation and Life Stage Segmentation)
•Behavioural Segmentation Techniques (K-Means Cluster Analysis, DBSCAN)
•Cluster evaluation and profiling – Identify cluster characteristics
•Interpretation of results – Implementation on new data

Time Series Forecasting: Solving forecasting problems
•Introduction – Applications
•Time Series Components( Trend, Seasonality, Cyclicity and Level) and Decomposition
•Classification of Time Series Techniques(Pattern based – Pattern less)
•Basic Techniques – Averages, Smoothening, etc
•Advanced Techniques – AR Models, ARIMA, etc
•Understanding Forecasting Accuracy – MAPE, MAD, MSE, etc

Supervised Learning: Decision Trees
•Decision Trees – Introduction – Applications
•Types of Decision Tree Algorithms
•Construction of Decision Trees through Simplified Examples; Choosing the “Best” attribute at each Non-Leaf node; Entropy; Information Gain, Gini Index, Chi Square, Regression Trees
•Generalizing Decision Trees; Information Content and Gain Ratio; Dealing with Numerical Variables; other Measures of Randomness
•Pruning a Decision Tree; Cost as a consideration; Unwrapping Trees as Rules
•Decision Trees – Validation
•Overfitting – Best Practices to avoid

Supervised Learning: Ensemble Learning
•Concept of Ensembling
•Manual Ensembling Vs. Automated Ensembling
•Methods of Ensembling (Stacking, Mixture of Experts)
•Bagging (Logic, Practical Applications)
•Random forest (Logic, Practical Applications)
•Boosting (Logic, Practical Applications)
•Ada Boost
•Gradient Boosting Machines (GBM)
•XGBoost

Supervised Learning: Artificial Neural Networks (ANN)
•Motivation for Neural Networks and Its Applications
•Perceptron and Single Layer Neural Network, and Hand Calculations
•Learning In a Multi Layered Neural Net: Back Propagation and Conjugant Gradient Techniques
•Neural Networks for Regression
•Neural Networks for Classification
•Interpretation of Outputs and Fine tune the models with hyper parameters
•Validating ANN models

Supervised Learning: Support Vector Machines
•Motivation for Support Vector Machine & Applications
•Support Vector Regression
•Support vector classifier (Linear & Non-Linear)
•Mathematical Intuition (Kernel Methods Revisited, Quadratic Optimization and Soft Constraints)
•Interpretation of Outputs and Fine tune the models with hyper parameters
•Validating SVM models

Supervised Learning: KNN
•What is KNN & Applications?
•KNN for missing treatment
•KNN For solving regression problems
•KNN for solving classification problems
•Validating KNN model
•Model fine tuning with hyper parameters

Supervised Learning: Naïve Bayes
•Concept of Conditional Probability
•Bayes Theorem and Its Applications
•Naïve Bayes for classification
•Applications of Naïve Bayes in Classifications

Text Mining & Analytics
•Taming big text, Unstructured vs. Semi-structured Data; Fundamentals of information retrieval, Properties of words; Creating Term-Document (TxD);Matrices; Similarity measures, Low-level processes (Sentence Splitting; Tokenization; Part-of-Speech Tagging; Stemming; Chunking)
•Finding patterns in text: text mining, text as a graph
•Natural Language processing (NLP)
•Text Analytics – Sentiment Analysis using R
•Text Analytics – Word cloud analysis using R
•Text Analytics –  Segmentation using K-Means/Hierarchical Clustering
•Text Analytics –  Classification (Spam/Not spam)
•Applications of Social Media Analytics
•Metrics(Measures Actions) in social media analytics
•Examples & Actionable Insights using Social Media Analytics

 

•Important python modules for Machine Learning (SciKit Learn, stats models, scipy, nltk etc)
•Fine tuning the models using Hyper parameters, grid search, piping etc.

 

•Project – Consolidate Learnings:
      •Applying different algorithms to solve the business problems and bench mark the results

 

Course Reviews

N.A

ratings
  • 1 stars0
  • 2 stars0
  • 3 stars0
  • 4 stars0
  • 5 stars0

No Reviews found for this course.

468 STUDENTS ENROLLED

    Key Features

    Need more help on a course. Drop us a Query

    Join Our Email-Newsletter

    Training in Cities

    Mumbai,Bangalore, Hyderabad, Chennai, Delhi, Kolkata, UK, London, Chicago, San Francisco, Dallas, Washington, New York, Orlando, Boston, Sydney, Singapore
    top
    Disclaimer

    CFA Institute does not endorse, promote, or warrant the accuracy or quality of the products or services offered by Courses Mojo. CFA Institute, CFA® Program, and Chartered Financial Analyst® are trademarks owned by CFA Institute.

    GARP does not endorse, promote, review or warrant the accuracy of the products or services offered by CoursesMojo of GARP Exam related information, nor does it endorse any pass rates that may be claimed by the Exam Prep Provider. Further, GARP is not responsible for any fees or costs paid by the user to Courses Mojo nor is GARP responsible for any fees or costs of any person or entity providing any services to Courses Mojo. ERP®, FRM®, GARP® and Global Association of Risk Professionals™ are trademarks owned by the Global Association of Risk Professionals, Inc.

    WordPress Image Lightbox Plugin

    Login

    Register

    FACEBOOKGOOGLE Create an Account
    Create an Account Back to login/register
    X