Exams 70-773 and 70-774 toward an MCSA
Overview
Here are links to materials I’ve come across for learning enough to pass Microsoft’s Azure Machine Learning certification exam 70-774
- 37 questions in 90 minutes at a VUE center for $165.
This is one way to demonstrate your expertise in making use of machine learning and Big Data with R Server and SQL R Services within the Microsoft Azure cloud, that you have a good understanding of Azure data services and are familiar with common data science processes such as filtering and transforming data sets, model estimation, and model evaluation. Candidates for this exam should have experience publishing effective APIs for knowledge intelligence.
Exam Reference Book
The 70-7734 exam began beta on Feb 17, 2017. The 336 page reference book for the exam was published Feb 2018. It is $32 from Microsoft, but $17.27 on Kindle.
PROTIP: In September 2017 “Studio” was added to the name “Azure Machine Learning” when referring to the web tool. This fixes a confusion in terminology between a tool versus a process. References in this website reflect that change.
The authors - Ginger Grant, Julio Granados, Guillermo Fernández, Pau Sempere, Javier Torrenteras also wrote the book on 70-773.
MCSA Bundle
Since Machine Learning often involves processing a lot of data, passing this exam as well as “Exam 70-773: Analyzing Big Data with Microsoft R” 70-773 gets you an MCSA: “Microsoft Certified Solutions Associate: Machine Learning certification”.
This exam set is referenced by vendors who offer bundles such as:
- PDF & Practice Exam from CertificationGenie
- https://www.netcomlearning.com/certifications/610/MCSA-Machine-Learning-training.html
- https://www.pass4itsure.com/70-774.html
Exam Topics - Top-level
A. Prepare data for analysis in Azure Machine Learning Studio and export from the Studio
B. Develop machine learning models
C. Operationalize and manage Azure Machine Learning services
D. Use other services for machine learning
Objectives
This provides a deeper list:
A. Prepare Data and Analytics in Azure Machine Learning Studio and Export from Azure Machine Learning Studio
- Import and export data to and from Azure Machine Learning Studio</a>
- Explore and summarize data
- Cleanse data for Azure Machine Learning
- Perform feature engineering
B. Develop Machine Learning Models
C. Operationalize and Manage Azure Machine Learning services
- Deploy models using Azure Machine Learning Studio
- Manage Azure Machine Learning projects and workspaces
- Consume Azure Machine Learning models
- Consume exemplar Cognitive Services APIs
D. Use Other Services for Machine Learning
- Build and use neural networks with the Microsoft Cognitive Toolkit
- Streamline development by using existing resources
- Perform data sciences at scale by using HDInsights
- Perform database analytics by using SQL Server R Services on Azure
There is considerable overlap of material with SQL and big data subject matter in the 70-774 exam.
Microsoft Classes
The team behind http://learnanalytics.microsoft.com/home/index has a GitHub.io page on 70-774 and 70-773 which mentions courses at EdX.com:
-
https://www.edx.org/course/principles-of-machine-learning DAT203.2x 6 x 4 (24 hour) hands-on class (by the brilliant Cythina Rudin of the Prediction Analytics Lab at Duke and Steve Elston of Quantia Analytics) covers classification, regression, models, and tree ensemble models, recommenders from clustering. Example code files are provided in both R and Python.
-
https://www.edx.org/course/applied-machine-learning-microsoft-dat203-3x-4 by Graeme Malcolm plus the authors listed above. This was not available at time of this writing May 2018. The class covers of time series and forecasting, spatial data analysis, text analytics, and analysis of images.
-
Content Developer Graeme Malcolm also narrated at the “DAT203.1x: Data Science Essentials” class at Edx.
Introduction to Machine Learning with Hands-On Labs dated June 15, 2016 provides a series on
- Introduction to AMLS - Lab Setup
https://github.com/Azure-Readiness/hol-azure-machine-learning
Tasks
Microsoft’s official reference page on the exam lists the tasks being tested.
PROTIP: Save this page and check off what you’re able to do as you learn each one.
A. Prepare Data for Analysis in Azure Machine Learning and Export from Azure Machine Learning
Import and export data to and from Azure Machine Learning
- [_] Import and export data to and from Azure Blob storage
- [_] import and export data to and from Azure SQL Database
- [_] import and export data via Hive Queries
- [_] import data from a website
- [_] import data from on-premises SQL
Explore and summarize data
- [_] Create univariate summaries
- [_] create multivariate summaries
- [_] visualize univariate distributions
- [_] use existing Microsoft R or Python notebooks for custom summaries and custom visualizations
- [_] use zip archives to import external packages for R or Python
- [_] Cleanse data for Azure Machine Learning
- [_] Apply filters to limit a dataset to the desired rows
- [_] identify and address missing data
- [_] identify and address outliers
- [_] remove columns and rows of datasets
Perform feature engineering
- [_] Merge multiple datasets by rows or columns into a single dataset by columns
- [_] merge multiple datasets by rows or columns into a single dataset by rows
- [_] add columns that are combinations of other columns
- [_] manually select and construct features for model estimation
- [_] automatically select and construct features for model estimation
- [_] reduce dimensions of data through principal component analysis (PCA)
- [_] manage variable metadata
- [_] select standardized variables based on planned analysis
B. Develop Machine Learning Models
Select an appropriate algorithm or method
- [_] Select an appropriate algorithm for predicting continuous label data
- [_] select an appropriate algorithm for supervised versus unsupervised scenarios
- [_] identify when to select R versus Python notebooks
- [_] identify an appropriate algorithm for grouping unlabeled data
- [_] identify an appropriate algorithm for classifying label data
- [_] select an appropriate ensemble
Initialize and train appropriate models
- [_] Tune hyperparameters manually
- [_] tune hyperparameters automatically
- [_] split data into training and testing datasets, including using routines for cross-validation
- [_] build an ensemble using the stacking method
Validate models
- [_] Score and evaluate models,
- [_] select appropriate evaluation metrics for clustering,
- [_] select appropriate evaluation metrics for classification,
- [_] select appropriate evaluation metrics for regression,
- [_] use evaluation metrics to choose between Machine Learning models,
- [_] compare ensemble metrics against base models
C. Operationalize and Manage Azure Machine Learning Services
Deploy models using Azure Machine Learning
- [_] Publish a model developed inside Azure Machine Learning
- [_] publish an externally developed scoring function using an Azure Machine Learning package
- [_] use web service parameters,
- [_] create and publish a recommendation model
- [_] create and publish a language understanding model
Manage Azure Machine Learning projects and workspaces
- [_] Create projects and experiments
- [_] add assets to a project
- [_] create new workspaces
- [_] invite users to a workspace, switch between different workspaces
- [_] create a Jupyter notebook that references an intermediate dataset
Consume Azure Machine Learning models
- [_] Connect to a published Machine Learning web service
- [_] consume a published Machine Learning model programmatically using a batch execution service
- [_] consume a published Machine Learning model programmatically using a request response service
- [_] interact with a published Machine Learning model using Microsoft Excel
- [_] publish models to the marketplace
Consume exemplar Cognitive Services APIs
- [_] Consume Vision APIs to process images
- [_] consume Language APIs to process text
- [_] consume Knowledge APIs to create recommendations
D. Use Other Services for Machine Learning
Build and use neural networks with the Microsoft Cognitive Toolkit (CNTK)
- [_] Use N-series VMs for GPU acceleration
- [_] build and train a three-layer feed forward neural network
- [_] determine when to implement a neural network
Streamline development by using existing resources
- [_] Clone template experiments from Cortana Intelligence Gallery
- [_] use Cortana Intelligence Quick Start to deploy resources
- [_] use a data science VM for streamlined development
Perform data sciences at scale by using HDInsights
- [_] Deploy the appropriate type of HDI cluster
- [_] perform exploratory data analysis by using Spark SQL
- [_] build and use Machine Learning models with Spark on HDI
- [_] build and use Machine Learning models using MapReduce
- [_] build and use Machine Learning models using Microsoft R Server
Perform database analytics by using SQL Server R Services on Azure
- [_] Deploy a SQL Server 2016 Azure VM
- [_] configure SQL Server to allow execution of R scripts
- [_] execute R scripts inside T-SQL statements
Study materials
I’ve rearranged Daniel Calbimonte’s massive list and others into my sequence below.
Microsoft Azure Essentials: Azure Machine Learning by Jeff Barnes is a free ebook from 2016.
HDInsight clusters
Introducing Microsoft Azure HDInsight on the Azure Portal by Avkash Chauhan, Valentive Fontama, and 3 others</li>
Premium Video Courses
- Introduction to AzureML by Richard Conway
- Real-Time Ingestion and Processing in Azure by Chris Pietschmann
- Getting Started with Azure Machine Learning by Jerry Kurt
- HDInsight Deep Dive: Storm, HBase, and Hive by Elton Stoneman
Videos
Follow this machine learning tutorial to use Azure Machine Learning Studio to create a linear regression model that predicts the price of an automobile based on different variables such as make and technical specifications. Then iterate on a simple predictive analytics experiment.
VIDEO: Hands-On with Azure Machine Learning
Machine Learning Algorthms - Part 1
this video: https://studio.azureml.net/?selectAccess=true&o=2
More
This is one of a series on AI, Machine Learning, Deep Learning, Robotics, and Analytics:
- AI Ecosystem
- Machine Learning
- Microsoft’s AI
- Microsoft’s Azure Machine Learning Algorithms
- Microsoft’s Azure Machine Learning tutorial
- Python installation
- Image Processing
- Tessaract OCR using OpenCV
- Multiple Regression calculation and visualization using Excel and Machine Learning
- Tableau Data Visualization