Machine Learning. What is Machine Learning?

Similar documents
Yuh: Ethnicity Classification

What is econometrics? INTRODUCTION. Scope of Econometrics. Components of Econometrics

There will be new in-depth reporting available to allow you to make better decisions. Sales can be analysed by Department, Supplier or Salesperson.

Braid Hairstyle Recognition based on CNNs

apts.ac.uk Week 2: University of Nottingham

Visual Search for Fashion. Divyansh Agarwal Prateek Goel

ACTIVITY 3-1 TRACE EVIDENCE: HAIR

An Introduction to Modern Object Detection. Gang Yu

Growth and Changing Directions of Indian Textile Exports in the aftermath of the WTO

Attributes for Improved Attributes

Balanced Assessment Elementary Grades Package 1 Dale Seymour Publications Correlated To I Get It! Math Grades 3-5 Modern Curriculum Press

Balanced Assessment Elementary Grades Package 1 Dale Seymour Publications Correlated To I Get It! Math Grades 3-5 Modern Curriculum Press

Girl Scout Daisy Activities to Earn the Making Choices Badge

Extension of Fashion Policy at Purchase of Garment on e-shopping Site

A Ranking-Theoretic Account of Ceteris Paribus Conditions

Research Article Artificial Neural Network Estimation of Thermal Insulation Value of Children s School Wear in Kuwait Classroom

the supple mind and its connection with life Mark Bedau Reed College

Unsupervised Ensemble Ranking: Application to Large-Scale Image Retrieval

Tattoo Detection Based on CNN and Remarks on the NIST Database

Predetermined Motion Time Systems

Comparison of Women s Sizes from SizeUSA and ASTM D Sizing Standard with Focus on the Potential for Mass Customization

REPORT OF INSTRUMENTAL TEST AFTER 30 DAYS OF PRODUCT APPLICATION

Using firm-level data to study growth and dispersion in total factor productivity

BT2A2. Make Up Art Standards Authority. VTCT Level 2 MASA Award in Make-up Principles MASA. award. Learner name: 603/0905/2. Learner number: BT2A2_v1

*Story: and- hispanic- wealth- hit- hardest- by- recession

Chi Square Goodness of fit, Independence, and Homogeneity May 07, 2014

2013/2/12 HEADACHED QUESTIONS FOR FEMALE. Hi, Magic Closet, Tell me what to wear MAGIC CLOSET: CLOTHING SUGGESTION

CSE 440 AD: Dylan Babbs, Hao Liu, Steven Austin, Tong Shen

For questions regarding the certification options, please contact TSFA at

THINK AND GET LAID: THE 11 KEYS TO UNLOCKING FEMALE ATTRACTION BY DOMINIC MANN

Summary and conclusions

The SLO Loop Diploma in Cosmetology COS-210 :Hair Coloring (2010SP )

INFLUENCE OF FASHION BLOGGERS ON THE PURCHASE DECISIONS OF INDIAN INTERNET USERS-AN EXPLORATORY STUDY

THE EUROPEAN UNION S REGULATORY ENVIRONMENT FOR COSMETICS

Subject : Apparel Merchandising. Unit 1 Introduction to apparel merchandising. Quadrant 1 e-text

Fashion Designers

TAKING ON THE CHALLENGE OF DEVELOPING 100% PLANT-BASED HAIR DYES

Postestimation commands predict estat procoverlay Remarks and examples Stored results Methods and formulas References Also see

SAULT COLLEGE OF APPLIED ARTS AND TECHNOLOGY SAULT STE. MARIE, ONTARIO COURSE OUTLINE

Session 3. Hair. Trainer requirements to teach this session. Trainer notes. For this session you will need the following:

Two Step Cluster Analysis. Multivariate Solutions

The basics of Flame retardant garments. Learn more about ISO 11612: Protection against heat and flame.

There s a woman having her hair cut.

In 2008, a study was conducted to measure the moisturizing performance of o/w skin care emulsions with 5 wt. % varying humectant that included Zemea

Fabrics. WL Gore & Associates An enterprise organized around four divisions. Medical Products. Industrial. Products. Electronic.

1

Color Swatch Add-on User Guide

Using the Stilwell Multimedia Virtual Community to Enhance Nurse Practitioner Education. Dr Mike Walsh & Ms Kathy Haigh University of Cumbria

Fashion Merchandising and Design. Fashion Merchandising and Design 10

Revised July Dress for Success Central Virginia. DONATION DRIVE KIT

Chapman Ranch Lint Cleaner Brush Evaluation Summary of Fiber Quality Data "Dirty" Module 28 September 2005 Ginning Date

CECIL COUNTY 4-H FASHION REVUE REGISTRATION FORM This form is due postmarked by Friday, June 22 nd. Judging Date June 30 th 9:00am at Extension Office

Effect of hair characteristics on vaginal temperature under hot and humid conditions in an Angus-Brahman multibreed herd.

HEALTHRIGHT INTERNATIONAL. Human Rights Clinic Photo Database: A Reference Tool Documenting the Long-term Physical Sequelae of Torture

Color Quantization to Visualize Perceptually Dominant Colors of an Image

Uniform that was previously acceptable from any other supplier will no longer be accepted from September 2017.

Session 3. Tests and testing. Trainer requirements to teach this lesson. Trainer notes. For this session you will need the following:

Fairfield Public Schools Family Consumer Sciences Curriculum Fashion Merchandising and Design 10

Fashion Design Merchandising

Straight Lines & Math

-SQA-SCOTTISH QUALIFICATIONS AUTHORITY. Hanover House 24 Douglas Street GLASGOW G2 7NQ NATIONAL CERTIFICATE MODULE DESCRIPTOR

Fashion Outfit Planning on E-Shopping Sites Considering Accordance to and Deviation from Policy

Using Graphics in the Math Classroom GRADE DRAFT 1

SIHHBAS201A Matrix Map

The AVQI with extended representativity:

An Patterned History of Ta Moko Stephanie Ip Karl Fousek Art History 100 Section 06

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad

Comments on the University of Joensuu s Matte Munsell Measurements

Vogue Paris Fashion Faux Pas

Student Handbook 2016

Credit value: 10 Guided learning hours: 60

FASHION MERCHANDISING B (405)

Model Curriculum. 1. Make-up Artist SECTOR: SUB-SECTOR: OCCUPATION: REF ID: NSQF LEVEL:

Feedback from City & Guilds Centres & Employers

FOR IMMEDIATE RELEASE

names 1 inch + Black Vis-à-Vis Black Sharpie

CURRICULUM MAP Cluster: Human Services CTE Program of Study: HU2310 Hair Stylist

CHAPTER 1 INTRODUCTION

Natural Fiber General Rules and Guidelines

Vikings : Topic Bundle : Year 5/6

Session 8. Perming and neutralising techniques and problems. Trainer requirements to teach this lesson. Trainer notes

SHAVING PRODUCT CATEGORY REPORT. Category Overview

SPECIAL Tattoos. BfR Consumer MONITOR

The Vikings in Ireland

Author name Giuliano Bettini* Title Astrophysics at home. Further hunting for possible micrometeorites. Abstract

CCS Administrative Procedure T Biosafety for Laboratory Settings

Scots Goods Price List and Order Form

OHIO UNIVERSITY HAZARD COMMUNICATION PROGRAM (FOR NON-LABORATORY APPLICATIONS) Dept. Name Today s Date Dept. Hazard Communication Contact

To Study the Effect of different income levels on buying behaviour of Hair Oil. Ragde Jonophar

In this lesson, students will create a duct tape wallet that they can use to hold or store currency and other financial items. Visual Arts.

ARGANisme cosmetics 55. bd Anoual. Casablanca MOROCCO Tel: Fax:

Integumentary System. The Skin you re in!

Sampling and Interpretation of Surface Measurements for Chemical Exposure Risk Assessment

Department of Industrial Engieering. Chapter : Predetermined Time Systems (PTS)

Chapter 2 Relationships between Categorical Variables

Final Report (December 2018)

(HOME SCIENCE) VI - Semester HS6BO11U FASHION DESIGNING AND APPAREL PRODUCTION

Chemical Inspection and Regulation Service (CIRS)

SAMPLE ASSESSMENT MATERIALS (SAMs)

Introduction to Fashion and Interior Design

Transcription:

Machine Learning What is Machine Learning? Programs that get better with experience given a task and some performance measure. Learning to classify news articles Learning to recognize spoken words Learning to play board games Learning to navigate (e.g. self-driving cars) Usually involves some sort of inductive reasoning step.

Inductive Reasoning Deductive reasoning (rule based reasoning) From the general to the specific Inductive reasoning From the specific to the general General Theory Deduction Induction Specific Facts Note: not to be confused with mathematical induction!

Example Facts: every time you see a swan you notice that the swan is white. Inductive step: you infer that all swans are white. Observed Swans are white. Induction All Swans are white. Inference is the act or process of drawing a conclusion based solely on what one already knows.

Observation Deduction is truth preserving If the rules employed in the deductive reasoning process are sound, then, what holds in the theory will hold for the deduced facts. Induction is NOT truth preserving It is more of a statistical argument The more swans you see that are white, the more probable it is that all swans are white. But this does not exclude the existence of black swans.

Observation D observations X universe of all swans

Different Styles of Machine Learning Supervised Learning The learning needs explicit examples of the concept to be learned (e.g. white swans, playing tennis, etc) Unsupervised Learning The learner discovers autonomously any structure in a domain that might represent an interesting concept

Knowledge - Representing what has been learned Symbolic Learners (transparent models) If-then-else rules Decision trees Association rules Sub-Symbolic Learners (non-transparent models) (Deep) Neural Networks Clustering (Self-Organizing Maps, k-means) Support Vector Machines

Decision Trees Learn from labeled observations - supervised learning Represent the knowledge learned in form of a tree Example: learning when to play tennis. Examples/observations are days with their observed characteristics and whether we played tennis or not

Play Tennis Example Outlook Temperature Humidity Windy PlayTennis Sunny Hot High False No Sunny Hot High True No Overcast Hot High False Yes Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No Overcast Cool Normal True Yes Sunny Mild High False No Sunny Cool Normal False Yes Rainy Mild Normal False Yes Sunny Mild Normal True Yes Overcast Mild High True Yes Overcast Hot Normal False Yes Rainy Mild High True No

Decision Tree Learning Induction Facts or Observations Theory

Interpreting a DT DT Decision Tree A DT uses the features of an observation table as nodes and the feature values as links. All feature values of a particular feature need to be represented as links. The target feature is special - its values show up as leaf nodes in the DT.

Interpreting a DT Each path from the root of the DT to a leaf can be interpreted as a decision rule. IF Outlook = Sunny AND Humidity = Normal THEN Playtennis = Yes IF Outlook = Overcast THEN Playtennis =Yes IF Outlook = Rain AND Wind = Strong THEN Playtennis = No

DT: Explanation & Prediction Explanation: the DT summarizes (explains) all the observations in the table perfectly 100% Accuracy Prediction: once we have a DT (or model) we can use it to make predictions on observations that are not in the original training table, consider: Outlook = Sunny, Temperature = Mild, Humidity = Normal, Windy = False, Playtennis =?

Constructing DTs How do we choose the attributes and the order in which they appear in a DT? Recursive partitioning of the original data table Heuristic - each generated partition has to be less random (entropy reduction) than previously generated partitions

Entropy S is a sample of training examples p + is the proportion of positive examples in S p - is the proportion of negative examples in S Entropy measures the impurity (randomness) of S S p + Entropy(S) - p + log 2 p + - p - log 2 p - Entropy(S) = Entropy([9+,5-]) =.94

Partitioning the Data Set Outlook Temperature Humidity Windy PlayTennis Sunny Hot High False No Sunny Hot High True No Sunny Mild High False No E =.97 Sunny Cool Normal False Yes Sunny Sunny Mild Normal True Yes Outlook Temperature Humidity Windy PlayTennis Outlook Overcast Overcast Hot High False Yes Overcast Cool Normal True Yes Overcast Mild High True Yes E = 0 Average Entropy =.64 Overcast Hot Normal False Yes (weighted.69) Rain y Outlook Temperature Humidity Windy PlayTennis Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No E =.97 Rainy Mild Normal False Yes Rainy Mild High True No

Partitioning in Action E =.640 E =.789 E =.892 E =.911

Recursive Partitioning Based on material from the book: "Machine Learning", Tom M. Mitchell. McGraw-Hill, 1997.

Recursive Partitioning Our data set: Outlook Temperature Humidity Windy PlayTennis Sunny Hot High False No Sunny Hot High True No Overcast Hot High False Yes Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No Overcast Cool Normal True Yes Sunny Mild High False No Sunny Cool Normal False Yes Rainy Mild Normal False Yes Sunny Mild Normal True Yes Overcast Mild High True Yes Overcast Hot Normal False Yes Rainy Mild High True No

Sunny Hot High False No Recursive Partitioning Sunny Hot High True No Overcast Hot High False Yes Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No Overcast Cool Normal True Yes Sunny Mild High False No Sunny Cool Normal False Yes Rainy Mild Normal False Yes Sunny Mild Normal True Yes Overcast Mild High True Yes Overcast Hot Normal False Yes Rainy Mild High True No Outlook Sunny Hot High False No Sunny Hot High True No Sunny Mild High False No Sunny Cool Normal False Yes Sunny Mild Normal True Yes Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No Rainy Mild Normal False Yes Rainy Mild High True No Overcast Hot High False Yes Overcast Cool Normal True Yes Overcast Mild High True Yes Overcast Hot Normal False Yes

Recursive Partitioning Outlook Sunny Hot High False No Sunny Hot High True No Sunny Mild High False No Sunny Cool Normal False Yes Sunny Mild Normal True Yes Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No Rainy Mild Normal False Yes Rainy Mild High True No Overcast Hot High False Yes Overcast Cool Normal True Yes Overcast Mild High True Yes Overcast Hot Normal False Yes

Recursive Partitioning Outlook Sunny Hot High False No Sunny Hot High True No Sunny Mild High False No Sunny Cool Normal False Yes Sunny Mild Normal True Yes Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No Rainy Mild Normal False Yes Rainy Mild High True No Overcast Hot High False Yes Humidity Overcast Cool Normal True Yes Overcast Mild High True Yes Overcast Hot Normal False Yes Sunny Cool Normal False Yes Sunny Mild Normal True Yes Sunny Hot High False No Sunny Hot High True No Sunny Mild High False No

Recursive Partitioning Outlook Sunny Hot High False No Sunny Hot High True No Sunny Mild High False No Sunny Cool Normal False Yes Sunny Mild Normal True Yes Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No Rainy Mild Normal False Yes Rainy Mild High True No Humidity Overcast Hot High False Yes Overcast Cool Normal True Yes Overcast Mild High True Yes Overcast Hot Normal False Yes Windy Sunny Cool Normal False Yes Sunny Mild Normal True Yes Sunny Hot High False No Sunny Hot High True No Sunny Mild High False No Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Mild Normal False Yes Rainy Cool Normal True No Rainy Mild High True No

Machine Learning in Python - Scikit-Learn We will be using the Scikit-Learn module to build decision trees. Scikit-learn or sklearn for short provides all kinds of models Neural networks Support vector machines Clustering algorithms Linear regression etc We will be using the treeviz module to visualize decision trees. A simple ASCII based tree visualizer

SKlearn Decision Tree Basics Training data needs to be structured into a feature matrix and a target vector. Axis 1 In the feature matrix one row for each observations. In the target vector one entry for each observation. Axis 0 NOTE: rows and vector entries have to be consistent!