Data Mining and Analysis

Course Code



28 hours (usually 4 days including breaks)



Delegates be able to analyse big data sets, extract patterns, choose the right variable impacting the results so that a new model is forecasted with predictive results.

Course Outline

  1. Data preprocessing

    1. Data Cleaning
    2. Data integration and transformation
    3. Data reduction
    4. Discretization and concept hierarchy generation
  2. Statistical inference

    1. Probability distributions, Random variables, Central limit theorem
    2. Sampling
    3. Confidence intervals
    4. Statistical Inference
    5. Hypothesis testing
  3. Multivariate linear regression

    1. Specification
    2. Subset selection
    3. Estimation
    4. Validation
    5. Prediction
  4. Classification methods

    1. Logistic regression
    2. Linear discriminant analysis
    3. K-nearest neighbours
    4. Naive Bayes
    5. Comparison of Classification methods
  5. Neural Networks

    1. Fitting neural networks
    2. Training neural networks issues
  6. Decision trees

    1. Regression trees
    2. Classification trees
    3. Trees Versus Linear Models
  7. Bagging, Random Forests, Boosting

    1. Bagging
    2. Random Forests
    3. Boosting
  8. Support Vector Machines and Flexible disct

    1. Maximal Margin classifier
    2. Support vector classifiers
    3. Support vector machines
    4. 2 and more classes SVM’s
    5. Relationship to logistic regression
  9. Principal Components Analysis

  10. Clustering

    1. K-means clustering
    2. K-medoids clustering
    3. Hierarchical clustering
    4. Density based clustering
  11. Model Assesment and Selection

    1. Bias, Variance and Model complexity
    2. In-sample prediction error
    3. The Bayesian approach
    4. Cross-validation
    5. Bootstrap methods

Client Testimonials


Bookings, Prices and Enquiries

Private Classroom

From 8000EUR

Private Remote

From 7000EUR (9)

Public Classroom

Cannot find a suitable date? Choose Your Course Date >>Too expensive? Suggest your price

Course Discounts

Course Venue Course Date Course Price [Remote / Classroom]
Introduction to R Kaunas, City Center Wed, 2018-07-04 09:30 4725EUR / 5525EUR
Introduction to Recommendation Systems Vaduz, Oberland Tue, 2018-07-31 09:30 1350EUR / 1750EUR
One Day Workshop for PEAP Authentication of Windows 7 Supplicant using a Cisco Switch as Authenticator and Windows 2008 R2 Server Kaunas, City Center Wed, 2018-09-26 09:30 1350EUR / 1750EUR
MS-20487 Developing Windows Azure and Web Services MTA Exam 70-487 Kaunas, City Center Mon, 2018-10-01 09:30 6750EUR / 7950EUR
jBPM for Developers Vaduz, Oberland Mon, 2018-10-15 09:30 7875EUR / 9075EUR

Course Discounts Newsletter

We respect the privacy of your email address. We will not pass on or sell your address to others.
You can always change your preferences or unsubscribe completely.