SVBook Certified Data Miner using Python


With over 2.5 quintillion bytes of data generated every day, businesses are trying to get insights form it. Data Science and Data Analytics have become important across industries. Data science and data analytics have became the most sought after and promising skills in future.

Our SVBook Certified Data Miner using Python uses CRISP DM data mining model and will go through Data Understanding (Descriptive Statistics, Regressions, Inferential Statistics, Data Visualizations), Data Preparation (Remove Duplicates), Modeling (Neural Networks, regression, KNN), Evaluation (10 folds Cross Validations), and Deployment (Slides, Reports, R Shiny Web apps).




Extracted from: https://www.pcmag.com/news/90-percent-of-the-big-data-we-generate-is-an-unstructured-mess

Data Science Certification Advantages


Experience cutting-edge online learning from world-class instructors through this Data Science certification course.


Data Science Course Details


This market-driven Data Science course covers the specialized skills required in the field of Data Science.

Course Curriculum

8 Weeks

  • Create Your Calculator: Learn Python Programming Basics Fast (Python Basics)
  • Applied Statistics using Python with Data Processing (Data Understanding and Data Preparation)
  • Advanced Data Visualizations using Python with Data Processing (Data Understanding and Data Preparation)
  • Machine Learning with Python (Modeling and Evaluation)

Skills Covered


Experience cutting-edge online learning from world-class instructors through this Data Science certification course.


  • Data Science
  • Data Analytics
  • Data Visualizations
  • Linear Regression
  • KNN
  • Naive Bayes
  • Exploratory Data Analysis
  • Data Preparation
  • Predictive Modeling

Tools Covered

Experience cutting-edge online learning from world-class instructors through this Data Science certification course.

Contact Us



About The Instructor...






Mr. Eric M. H. Goh
Eric Goh is a data scientist, software engineer, adjunct faculty and entrepreneur with years of experiences in multiple industries. His varied career includes data science, data and text mining, natural language processing, machine learning, intelligent system development, and engineering product design. He founded SVBook sole proprietorship in 2016 and reregister to SVBook Pte. Ltd. in 2018, because of contract requirements, and extended it with DSTK.Tech and EMHAcademy.com. DSTK.Tech is where Eric develops his own DSTK data science softwares (public version) and uploaded at sourceforge and github. Eric also published “Learn R for Applied Statistics” at Apress, and published some books at LeanPub and SVBook Pte. Ltd. He teaches the content at Udemy and EMHAcademy.com, and developed 28 courses, 1 E-Diploma, 7 advanced certificates. Eric is also an adjunct faculty at Universities and Institutions, which is a consultancy from EMHAcademy.com.

Eric Goh has been leading his teams for various industrial projects, including the advanced product code classification system project which automates Singapore Custom’s trade facilitation process, and Nanyang Technological University's data science and ranking projects where he develop his own DSTK data science software (NTU version) for QS ranking project, JATI for QS mobile apps screenshots to text project and JAVT for Convocation videos to text project. Eric wrote some guides to use these softwares for NTU projects - DSTK, JAVT, JATI . While in NTU, from 2015 to 2018, NTU ranking did not fall even when NUS ranking fall - First Year, Second Year, Third Year. In second year, NTU data increases, hence, research and development of DSTK (NTU Version) software, and management reduced the data for third year. DSTK (public version) is at DSTK.Tech, DSTK is developed in 2017 and is similar to the QuillEdit software developed in 2006. He has years of experience in C#, Java, C/C++, SPSS Statistics and Modeller, SAS Enterprise Miner, R, Python, Excel, Excel VBA and etc. He won Tan Kah Kee Young Inventors' Merit Award and Shortlisted Entry for TelR Data Mining Challenge.

Eric holds a Masters of Technology degree in Knowledge Engineering (Machine Learning) from the National University of Singapore (NUS) (download opencert file and put here. What is OpenCert.... ) in 2013. Eric also possessed Executive Master of Business Administration (MBA) degree (click here) from IGNOU (http://ignou.ac.in) and Executive Certificate in Global IT Management from U21Global (currently GlobalNxt) in 2012, where he's delayed and left exams while studying in NUS. He has a Graduate Diploma in Mechatronics from A*STAR SIMTech (a national research institute located in Nanyang Technological University) in 2011, which he completed while working in SUTD. He has Coursera Specialization Certificate in Business Statistics and Analysis (Excel) from Rice University in 2017, IBM Data Science Professional Certificate (Python, SQL) in 2018, and Coursera Verified Certificate in R Programming from Johns Hopkins University in 2017, More Data Mining with Weka Certificate from University of Waikato, Coursera Verified Certificate in Data Visualization and Communication with Tableau from Duke University in 2016, Coursera Verified Certificate in Internet of Things and Augmented Reality Emerging Technologies from Yonsei University in 2017. Eric continuous upgrade himself using Coursera courses after 2013, which are University courses. He possessed a Bachelor of Science degree in Computing from the University of Portsmouth in 2010, which he completed while in National Service (2008 to 2010). He holds a Diploma with Merit in Electronics and Computer Engineering from Ngee Ann Polytechnic in 2008. He is also an AIIM Certified Business Process Management Master (BPMM) in 2011, GSTF certified Big Data Science Analyst (CBDSA), IES Certified Lecturer in 2016, and holds the Coursera Verified Certificate in University Teaching from The University of Hong Kong in 2019, and University of Wisconsin-Madison Fundamentals of Online Teaching certificate in 2016.

More...


Request Information