What is the main difference between supervised and unsupervised learning?
Supervised learning uses labeled data, while unsupervised learning uses unlabeled data.
What is bias in data science?
Bias refers to systematic errors in data collection or analysis that can lead to incorrect conclusions.
1/123
p.1
Supervised Learning

What is the main difference between supervised and unsupervised learning?

Supervised learning uses labeled data, while unsupervised learning uses unlabeled data.

p.1
Bias in Data Science

What is bias in data science?

Bias refers to systematic errors in data collection or analysis that can lead to incorrect conclusions.

p.1
Model Evaluation Techniques

Why is model evaluation important in machine learning?

Model evaluation is crucial to assess the performance and accuracy of a model on unseen data.

p.2
Machine Learning Definition

What is Machine Learning (ML)?

A subset of artificial intelligence (AI) that enables systems to learn from data and improve performance over time without explicit programming.

p.2
Machine Learning Definition

How do self-learning algorithms in Machine Learning function?

They derive knowledge from data in order to make predictions.

p.14
Classification Techniques

How does a decision tree help in decision-making?

By figuring out the best choice through a series of questions.

p.3
Applications of Machine Learning

When is machine learning used?

When expertise is lacking, such as navigating on Mars.

p.3
Applications of Machine Learning

Why is machine learning useful in speech recognition?

Because human knowledge is difficult to articulate.

p.5
Supervised Learning

What is supervised learning?

A type of machine learning where the model is trained on labeled data, with each training example paired with an output label.

p.22
Regression Analysis

When should you use regression?

When changing X affects the outcome or when aiming to determine how X affects Y or predict Y.

p.2
Machine Learning Definition

What is the relationship between Machine Learning and Artificial Intelligence?

Machine Learning is a subfield of Artificial Intelligence.

p.27
Bias in Data Science

What is learning bias in machine learning?

Different ML algorithms encode different ways to generalize from a data set.

p.12
Regression Analysis

Give an example of linear regression.

Predicting how much a house will sell for based on its size.

p.22
Correlation Techniques

What does correlation measure?

The association or co-relationship between two variables.

p.13
Classification Techniques

When is logistic regression selected?

When the dependent variable is categorical with binary outputs.

p.31
Model Evaluation Techniques

What do proper evaluation techniques ensure for machine learning models?

They ensure models generalize well to unseen data.

p.28
Model Evaluation Techniques

Why is model evaluation important in machine learning?

Model evaluation is crucial to assess the performance and accuracy of a model on unseen data.

p.19
Unsupervised Learning

What does K-Means clustering help you find in your data?

Natural groupings based on similarities.

p.16
Classification Techniques

What is the purpose of the hyperplane in SVM?

To create maximal distance between two classes of data points.

p.21
Unsupervised Learning

What is a typical use case for Unsupervised Learning?

Customer segmentation.

p.12
Regression Analysis

What does linear regression identify?

Relationships between two things.

p.14
Applications of Machine Learning

What is an example application of decision trees?

Predicting the risk of heart attack.

p.28
Bias in Data Science

What is bias in data science?

Bias refers to systematic errors in data collection or analysis that can lead to incorrect conclusions.

p.27
Bias in Data Science

What does sampling bias describe?

How the process used to select a data set can introduce biases into later analysis.

p.31
Model Evaluation Techniques

What is the benefit of continuous evaluation and refinement of models?

It leads to better model performance.

p.10
Unsupervised Learning

What type of data does unsupervised learning deal with?

Unlabeled data or data of unknown structure.

p.26
Bias in Data Science

Is there a single best machine learning algorithm for all data sets?

No, there is no one best ML algorithm that outperforms all others across all possible data sets.

p.20
Unsupervised Learning

What does Market Basket Analysis help retailers figure out?

Which products are frequently bought together.

p.22
Correlation Techniques

When should you use correlation?

When swapping X and Y gives the same result or when analyzing if there is a relationship between X and Y.

p.10
Supervised Learning

What is supervised learning?

A type of machine learning where the model is trained on labeled data, with each training example paired with an output label.

p.7
Supervised Learning

What does the model do in Supervised Learning after being trained?

It identifies the data and provides the desired response, such as recognizing an apple.

p.13
Classification Techniques

What is a practical example of using logistic regression?

Predicting whether a person is happy or sad.

p.11
Supervised Learning

What are the two types of supervised learning mentioned?

Regression and Classification.

p.30
Model Evaluation Techniques

What is the role of the validation dataset?

To tune the model's hyperparameters and assess how well the model generalizes to unseen data.

p.11
Classification Techniques

What is the goal of classification in supervised learning?

To predict the categorical class labels of new instances based on past observations.

p.30
Model Evaluation Techniques

What does the test dataset evaluate?

The final performance of the model after training and validation, providing an unbiased assessment.

p.15
Applications of Machine Learning

What is an example application of Random Forest?

Predicting a fruit type based on factors like colour, size, and place of origin.

p.15
Supervised Learning

Can Random Forest be used for both regression and classification tasks?

Yes, it is flexible enough for both.

p.6
Supervised Learning

What is the main goal of supervised learning?

To predict outcomes based on input data using labeled examples.

p.6
Unsupervised Learning

Can unsupervised learning be used for clustering tasks?

Yes, it is often used for clustering tasks.

p.9
Pros and Cons of Unsupervised Learning

What aspect of data can unsupervised learning be sensitive to?

Outliers and noise in the data.

p.31
Model Evaluation Techniques

Why is model evaluation essential in machine learning?

It is essential for building reliable machine learning systems.

p.19
Unsupervised Learning

What is K-Means clustering?

A popular method used in unsupervised learning for grouping data into clusters.

p.3
Applications of Machine Learning

In what scenario is customization of models required?

In personalized medicine.

p.29
Model Evaluation Techniques

What is model evaluation in machine learning?

It assesses how well a machine learning model performs on unseen data.

p.15
Pros and Cons of Supervised Learning

What are the main benefits of using Random Forest?

It improves accuracy and reduces the chance of overfitting.

p.18
Unsupervised Learning

What is K-means?

A type of clustering method used in unsupervised learning.

p.4
Machine Learning Definition

What role does data play in machine learning?

Data is used to train models, allowing them to make predictions or decisions without explicit programming.

p.29
Model Evaluation Techniques

Why is generalization important in model evaluation?

It ensures that the model performs well on new, unseen data.

p.17
Unsupervised Learning

What type of data does unsupervised learning deal with?

Unlabeled data or data of unknown structure.

p.17
Unsupervised Learning

What is the main goal of unsupervised learning?

To discover hidden patterns in data without the need for human intervention.

p.8
Pros and Cons of Supervised Learning

What issue can arise if supervised learning models are not properly validated?

They are prone to overfitting.

p.9
Pros and Cons of Unsupervised Learning

What is a key application of unsupervised learning?

Useful for exploratory data analysis.

p.9
Pros and Cons of Unsupervised Learning

What is a challenge associated with interpreting results from unsupervised learning?

Results can be difficult to interpret without clear labels.

p.23
Correlation Analysis

What does a Pearson correlation measure?

The strength of a linear relationship between two numeric attributes.

p.21
Supervised Learning

What is the main goal of Supervised Learning?

To learn from labeled data to predict outcomes.

p.14
Classification Techniques

What is a decision tree?

A simple way to make decisions based on different criteria by asking a series of questions.

p.28
Supervised Learning

What is the main difference between supervised and unsupervised learning?

Supervised learning uses labeled data, while unsupervised learning uses unlabeled data.

p.7
Supervised Learning

What is the goal of Supervised Learning?

To train a model to identify specific categories, such as distinguishing between an apple and another fruit.

p.16
Supervised Learning

What is a support vector machine (SVM)?

A supervised learning model typically used for data classification problems.

p.15
Supervised Learning

What type of machine learning algorithm is Random Forest?

A supervised machine learning algorithm.

p.18
Unsupervised Learning

What is clustering in unsupervised learning?

An exploratory data analysis technique that organizes information into meaningful subgroups without knowing group memberships.

p.4
Machine Learning Definition

How does traditional programming approach problem-solving?

By using predefined rules and logic to process inputs and produce outputs.

p.3
Applications of Machine Learning

Why is machine learning important in genomics?

Because analysis relies on vast amounts of data.

p.10
Unsupervised Learning

What is the main goal of unsupervised learning?

To discover hidden patterns in data without the need for human intervention.

p.17
Supervised Learning

What is supervised learning?

A type of machine learning where the model is trained on labeled data, with each training example paired with an output label.

p.10
Supervised Learning

How does supervised learning differ from unsupervised learning?

Supervised learning uses labeled data, while unsupervised learning deals with unlabeled data.

p.11
Classification Techniques

What are some techniques used in classification?

Decision Trees, Support Vector Machines (SVM), and Random Forest.

p.6
Unsupervised Learning

What is unsupervised learning?

A type of machine learning where the model is trained on unlabeled data to find patterns.

p.6
Supervised Learning

Can supervised learning be used for classification tasks?

Yes, it is commonly used for classification tasks.

p.6
Supervised Learning

What type of data is required for supervised learning?

Labeled data.

p.21
Supervised Learning

What type of output does Supervised Learning produce?

Specific predictions or classifications.

p.20
Unsupervised Learning

What is Market Basket Analysis?

A technique used in retail to understand the purchasing behavior of customers.

p.13
Classification Techniques

What are examples of binary outputs in logistic regression?

'Yes' and 'No' or 'True' and 'False'.

p.11
Regression Analysis

What is regression analysis used for?

To predict continuous outcomes by finding relationships between variables.

p.30
Model Evaluation Techniques

What is the purpose of the training dataset in machine learning?

To train the machine learning model and allow it to learn from the data.

p.5
Unsupervised Learning

What type of data does unsupervised learning deal with?

Unlabeled data or data of unknown structure.

p.7
Unsupervised Learning

What is the main task of Unsupervised Learning?

To group similar items together, such as apples and pears that look alike.

p.16
Applications of Machine Learning

Give an example of a classification problem that can be solved using SVM.

Classifying emails as either 'spam' or 'not spam'.

p.7
Unsupervised Learning

How does the trained model in Unsupervised Learning categorize data?

By putting similar items into the same groups.

p.11
Regression Analysis

Name two methods used in regression analysis.

Linear regression and logistic regression.

p.4
Applications of Machine Learning

What is a key advantage of machine learning over traditional programming?

Machine learning can handle complex patterns and large datasets more effectively than traditional programming.

p.8
Pros and Cons of Supervised Learning

What is a key feature of supervised learning regarding algorithms?

A wide range of algorithms is available for different tasks.

p.23
Correlation Analysis

What does a correlation describe?

The strength of association between two attributes.

p.21
Unsupervised Learning

What is a common algorithm used in Unsupervised Learning?

K-Means Clustering.

p.21
Pros and Cons of Supervised Learning

When is Supervised Learning ideal to use?

When labeled data is available and specific predictions are needed.

p.25
Supervised Learning

What is the main difference between supervised and unsupervised learning?

Supervised learning uses labeled data, while unsupervised learning uses unlabeled data.

p.4
Machine Learning Definition

What is the main difference between machine learning and traditional programming?

In traditional programming, rules are explicitly coded, while in machine learning, the system learns from data.

p.25
Bias in Data Science

What is bias in data science?

Bias refers to systematic errors in data collection or analysis that can lead to incorrect conclusions.

p.19
Unsupervised Learning

Can you give an example of K-Means clustering?

Grouping different fruits based on features like species and color.

p.15
Supervised Learning

How does Random Forest achieve its results?

By combining multiple decision trees.

p.16
Classification Techniques

What features might be used to classify emails in SVM?

The presence of certain words or the sender.

p.18
Unsupervised Learning

What is market basket analysis?

An example of an association method used to find relationships between items in transactions.

p.8
Pros and Cons of Supervised Learning

What is another benefit of using supervised learning?

Clear interpretation of model results.

p.6
Unsupervised Learning

What is the main goal of unsupervised learning?

To discover hidden structures or patterns in data without predefined labels.

p.9
Pros and Cons of Unsupervised Learning

What is a challenge when using K-Means clustering in unsupervised learning?

Choosing the correct number of clusters (K) can be challenging.

p.21
Unsupervised Learning

What type of data does Unsupervised Learning use?

Unlabeled data to identify patterns.

p.25
Model Evaluation Techniques

Why is model evaluation important in machine learning?

Model evaluation helps determine the effectiveness and accuracy of a model in making predictions.

p.27
Bias in Data Science

How does a linear regression algorithm exhibit learning bias?

It encodes a linear generalization from the data and ignores nonlinear relationships.

p.18
Unsupervised Learning

What is association in unsupervised learning?

An unsupervised learning method that finds relationships between variables in a dataset.

p.27
Bias in Data Science

What factors contribute most to bias in machine learning?

The dataset the algorithm runs on and the choice of algorithm.

p.8
Pros and Cons of Supervised Learning

What is a major advantage of supervised learning?

High accuracy with sufficient labeled data.

p.8
Pros and Cons of Supervised Learning

What is a significant drawback of supervised learning?

It requires large amounts of labeled data, which can be time-consuming and expensive to obtain.

p.9
Pros and Cons of Unsupervised Learning

How can unsupervised learning be beneficial in data analysis?

It can uncover hidden patterns and insights in data.

p.6
Unsupervised Learning

What type of data is required for unsupervised learning?

Unlabeled data.

p.21
Supervised Learning

What is a typical use case for Supervised Learning?

Predicting house prices.

p.21
Unsupervised Learning

What type of output does Unsupervised Learning produce?

Groups data points or identifies structure without specific outputs.

p.29
Model Evaluation Techniques

What is the purpose of model evaluation?

To ensure generalization to new data, identify strengths and weaknesses of the model, and guide model selection and improvement.

p.5
Supervised Learning

How does supervised learning differ from unsupervised learning?

Supervised learning uses labeled data, while unsupervised learning deals with unlabeled data.

p.30
Model Evaluation Techniques

How should a dataset be divided for effective machine learning?

Into three parts: training, validation, and test datasets.

p.17
Supervised Learning

How does supervised learning differ from unsupervised learning?

Supervised learning uses labeled data, while unsupervised learning deals with unlabeled data.

p.9
Pros and Cons of Unsupervised Learning

What is a major advantage of unsupervised learning regarding data preparation?

No need for labeled data, simplifying data preparation.

p.23
Correlation Analysis

What is the range of values for a Pearson correlation?

From -1 to +1.

p.21
Supervised Learning

What is a common algorithm used in Supervised Learning?

Linear Regression.

p.5
Unsupervised Learning

What is the main goal of unsupervised learning?

To discover hidden patterns in data without the need for human intervention.

p.22
Regression Analysis

What does regression describe?

How to numerically relate an independent variable to a dependent variable.

p.4
Machine Learning Definition

Can machine learning adapt to new data?

Yes, machine learning models can improve and adapt as they are exposed to more data.

p.29
Model Evaluation Techniques

How does model evaluation help in model selection?

It identifies the strengths and weaknesses of different models, guiding the selection process.

p.6
Supervised Learning

What is supervised learning?

A type of machine learning where the model is trained on labeled data.

p.8
Pros and Cons of Supervised Learning

What challenge does supervised learning face with certain datasets?

It may struggle with imbalanced datasets.

p.23
Correlation Analysis

What letter is used to denote the Pearson correlation coefficient?

The letter r.

p.21
Supervised Learning

What is required for Supervised Learning?

A dataset with input-output pairs (features and labels).

p.21
Pros and Cons of Unsupervised Learning

When is Unsupervised Learning useful?

For exploring data and discovering hidden patterns without predefined labels.

Study Smarter, Not Harder
Study Smarter, Not Harder