<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>IBM-ML on Ghafoor's Personal Blog</title><link>http://ghafoorsblog.com/series/ibm-ml/</link><description>Recent content in IBM-ML on Ghafoor's Personal Blog</description><generator>Hugo</generator><language>en</language><managingEditor>noreply@example.com (AG Sayyed)</managingEditor><webMaster>noreply@example.com (AG Sayyed)</webMaster><copyright>Copyright © 2024-2026 AG Sayyed. All Rights Reserved.</copyright><lastBuildDate>Thu, 16 Apr 2026 17:23:48 +0100</lastBuildDate><atom:link href="http://ghafoorsblog.com/series/ibm-ml/index.xml" rel="self" type="application/rss+xml"/><item><title>Outliers and Missing Values</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/02-module/004-outliers-part-1/</link><pubDate>Mon, 31 Mar 2025 14:11:35 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/02-module/004-outliers-part-1/</guid><description>&lt;p class="lead text-primary"&gt;
This document provides a comprehensive guide on handling missing values and outliers in datasets, including techniques for detection, imputation, and removal, along with practical Python code examples. It also discusses the impact of these issues on machine learning models and offers strategies for effective data preprocessing.
&lt;/p&gt;


&lt;hr&gt;
&lt;h2 id="understanding-data-quality-issues"&gt;Understanding Data Quality Issues&lt;/h2&gt;
&lt;p&gt;Before diving into specific techniques, it&amp;rsquo;s important to understand why missing values and outliers matter:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Impact on Model Performance&lt;/strong&gt;: These issues can significantly reduce model accuracy and reliability&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Bias Introduction&lt;/strong&gt;: Improper handling can lead to biased models that don&amp;rsquo;t generalize well&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Data Integrity&lt;/strong&gt;: They often signal problems in data collection or processing that need addressing&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;A systematic approach to handling these issues is essential for building robust machine learning models.&lt;/p&gt;</description></item><item><title>Data Cleaning</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/02-module/003-data-cleaning/</link><pubDate>Mon, 31 Mar 2025 13:57:30 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/02-module/003-data-cleaning/</guid><description>&lt;p class="lead text-primary"&gt;
This document explains the importance of data cleaning in machine learning, common issues with messy data, and methods for handling duplicate data to ensure reliable model outcomes.
&lt;/p&gt;


&lt;hr&gt;
&lt;h2 id="importance-of-data-cleaning"&gt;Importance of Data Cleaning&lt;/h2&gt;
&lt;p&gt;Data cleaning is a critical step in the machine learning workflow. Models rely on accurate and clean data to produce reliable outcomes. Messy data can misrepresent relationships between features and targets, leading to the &amp;ldquo;garbage-in, garbage-out&amp;rdquo; effect. Key aspects affected by messy data include:&lt;/p&gt;</description></item><item><title>Retrieving Data from SQL and NoSQL Databases, APIs, and Cloud Data Sources</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/02-module/002-retrieving-data-part-2/</link><pubDate>Sun, 30 Mar 2025 20:47:51 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/02-module/002-retrieving-data-part-2/</guid><description>&lt;p class="lead text-primary"&gt;
This document explains methods for retrieving data from SQL and NoSQL databases, APIs, and Cloud data sources, highlighting practical considerations and Python code examples for seamless data integration.
&lt;/p&gt;


&lt;hr&gt;
&lt;h2 id="retrieving-data-from-different-sources"&gt;Retrieving Data from Different Sources&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;SQL Databases&lt;/strong&gt;: Structured Query Language databases are relational databases with fixed schemas. They are widely used for data storage and retrieval.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;NoSQL Databases&lt;/strong&gt;: Non-relational databases that offer flexibility in data storage and retrieval. They are often faster and more scalable than SQL databases.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;APIs&lt;/strong&gt;: Application Programming Interfaces allow access to data from various providers, enabling seamless integration with external data sources.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Cloud Data Sources&lt;/strong&gt;: Cloud platforms provide data storage and retrieval services, allowing users to access data from anywhere with an internet connection.&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="working-with-sql-databases"&gt;Working with SQL Databases&lt;/h2&gt;
&lt;p&gt;SQL (Structured Query Language) databases are relational databases with fixed schemas. Examples include Microsoft SQL Server, Postgres, MySQL, AWS Redshift, Oracle DB, and IBM Db2. Python libraries such as &lt;code&gt;sqlite3&lt;/code&gt;, &lt;code&gt;SQLAlchemy&lt;/code&gt;, &lt;code&gt;Psycopg2&lt;/code&gt; (for Postgres), and &lt;code&gt;ibm_db&lt;/code&gt; (for Db2) can be used to connect to these databases.&lt;/p&gt;</description></item><item><title>Retrieving Data from CSV and JSON Files</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/02-module/001-retrieving-data-part-1/</link><pubDate>Sun, 30 Mar 2025 20:43:48 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/02-module/001-retrieving-data-part-1/</guid><description>&lt;p class="lead text-primary"&gt;
This document explains methods for retrieving data from various sources, including CSV and JSON files, and highlights practical considerations when working with these formats using Python and Pandas.
&lt;/p&gt;


&lt;hr&gt;
&lt;h2 id="retrieving-data-from-different-sources"&gt;Retrieving Data from Different Sources&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;CSV Files&lt;/strong&gt;: Comma Separated Values files are widely used for storing tabular data. They can be easily read into Pandas DataFrames.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;JSON Files&lt;/strong&gt;: JavaScript Object Notation files are commonly used for structured data storage. They can also be read into Pandas DataFrames.&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id="downloading-data-files"&gt;Downloading Data Files&lt;/h3&gt;
&lt;p&gt;For this exercise, the Iris dataset is used which contains information about different species of iris flowers. The dataset is available in both CSV and JSON formats. Download the files from the following links:&lt;/p&gt;</description></item><item><title>Machine Learning Workflow</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/005-ml-workflow/</link><pubDate>Sun, 30 Mar 2025 16:07:22 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/005-ml-workflow/</guid><description>&lt;p class="lead text-primary"&gt;
This document explains the foundational concepts, workflow, and vocabulary of machine learning, providing a clear understanding of the tools and processes involved in building and deploying machine learning models.
&lt;/p&gt;


&lt;hr&gt;
&lt;hr&gt;
&lt;h2 id="1-machine-learning-workflow"&gt;1. Machine Learning Workflow&lt;/h2&gt;
&lt;p&gt;The machine learning workflow is a structured approach to developing and deploying machine learning models. It consists of several key steps that guide practitioners from problem definition to model deployment. The following table outlines the main steps in the workflow:&lt;/p&gt;</description></item><item><title>Modern AI Applications and ML Workflow</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/004-modern-ai/</link><pubDate>Sun, 30 Mar 2025 15:38:37 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/004-modern-ai/</guid><description>&lt;p class="lead text-primary"&gt;
This document explains the current advancements in artificial intelligence (AI), highlighting its transformative impact across industries, the factors driving this era of innovation, and the practical applications shaping everyday life.
&lt;/p&gt;


&lt;hr&gt;
&lt;h2 id="1-the-current-ai-landscape"&gt;1. The Current AI Landscape&lt;/h2&gt;
&lt;p&gt;Artificial intelligence is experiencing a transformative era, driven by advancements in computer vision and natural language processing (NLP). These technologies are reshaping industries and enhancing everyday life through practical applications.&lt;/p&gt;
&lt;h3 id="11-key-areas-of-growth"&gt;1.1. Key Areas of Growth&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Computer Vision&lt;/strong&gt;: Significant progress has been made in areas such as autonomous vehicles and medical imaging. AI systems are now capable of diagnosing illnesses from x-rays and MRIs with accuracy comparable to, or better than, medical professionals.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Natural Language Processing&lt;/strong&gt;: NLP has advanced in tasks like translation, sentiment analysis, article clustering, and automated writing, enabling a wide range of applications.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="12-factors-driving-this-era-of-ai"&gt;1.2. Factors Driving This Era of AI&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Larger Datasets&lt;/strong&gt;: The availability of diverse and extensive datasets, supported by cloud infrastructure, has enabled AI models to learn complex patterns across various domains.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Faster Computing&lt;/strong&gt;: Modern hardware offers powerful processing capabilities at a fraction of the cost of earlier systems, making AI more accessible and efficient.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Advances in Neural Networks&lt;/strong&gt;: Innovations in deep learning have led to practical results, enabling AI to excel in complex tasks such as facial recognition, voice commands, and image classification.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="13-everyday-applications"&gt;1.3. Everyday Applications&lt;/h3&gt;
&lt;p&gt;AI is now integrated into daily life through technologies like facial recognition for unlocking phones, voice-activated home automation, cashier-less stores, and personalized recommendations.&lt;/p&gt;</description></item><item><title>History of AI</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/003-history-of-ai/</link><pubDate>Sun, 30 Mar 2025 14:18:02 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/003-history-of-ai/</guid><description>&lt;p class="lead text-primary"&gt;
This document explains the historical development of artificial intelligence (AI), highlighting its cycles of progress and setbacks, key milestones, and the evolution of technologies that have shaped the field.
&lt;/p&gt;


&lt;hr&gt;
&lt;h2 id="1-the-history-of-artificial-intelligence"&gt;1. The History of Artificial Intelligence&lt;/h2&gt;
&lt;p&gt;Artificial intelligence has experienced several cycles of significant advancements and setbacks, often referred to as &amp;ldquo;AI Winters.&amp;rdquo; These cycles have been marked by periods of excitement and investment, followed by disillusionment and reduced funding. The following sections outline the key milestones in AI&amp;rsquo;s history.&lt;/p&gt;</description></item><item><title>Machine Learning and Deep learning</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/002-ml-deep-learning/</link><pubDate>Wed, 20 Nov 2024 03:33:25 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/002-ml-deep-learning/</guid><description>&lt;p class="lead text-primary"&gt;
This document provides an overview of Machine Learning (ML) and Deep Learning (DL), including their definitions, differences, and applications. It also discusses the importance of features and targets in ML, as well as the challenges associated with image data. The document concludes with a brief introduction to the history of AI and its evolution into the current state of technology.
&lt;/p&gt;


&lt;hr&gt;
&lt;h2 id="1-introduction-to-machine-learning"&gt;1. Introduction to Machine Learning&lt;/h2&gt;
&lt;p&gt;Machine Learning (ML) is a branch of Artificial Intelligence (AI) focused on developing algorithms that learn from data over time, rather than being explicitly programmed. By analysing data, ML algorithms identify patterns and improve their performance as more data becomes available. However, after a certain point, the performance gains from additional data diminish, leading to a plateau. Additionally, ML algorithms can be categorized into different types based on their learning approach, such as supervised, unsupervised, and reinforcement learning.&lt;/p&gt;</description></item><item><title>AI and Machine Learning Introduction</title><link>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/001-introduction/</link><pubDate>Wed, 20 Nov 2024 02:33:25 +0000</pubDate><author>noreply@example.com (AG Sayyed)</author><guid>http://ghafoorsblog.com/courses/ibm/ml-content/ml-pcert/01-data-analysis-for-ml/01-module/001-introduction/</guid><description>&lt;p class="lead text-primary"&gt;
This document introduces Artificial Intelligence (AI) and Machine Learning (ML), explaining their definitions, history, real-world applications, and the differences between AI, ML, and Deep Learning. It also highlights the importance of understanding features and targets in ML.
&lt;/p&gt;


&lt;hr&gt;

&lt;blockquote class="alert alert-info" role="alert"&gt;
 &lt;p class="alert-heading fw-bold"&gt;
 &lt;svg aria-hidden="true" class="bi bi-info-circle hi-svg-inline me-1 me-lg-2" fill="currentColor" height="1em" viewBox="0 0 16 16" width="1em" xmlns="http://www.w3.org/2000/svg"&gt;
 &lt;path d="M8 15A7 7 0 1 1 8 1a7 7 0 0 1 0 14m0 1A8 8 0 1 0 8 0a8 8 0 0 0 0 16"/&gt;
 &lt;path d="m8.93 6.588-2.29.287-.082.38.45.083c.294.07.352.176.288.469l-.738 3.468c-.194.897.105 1.319.808 1.319.545 0 1.178-.252 1.465-.598l.088-.416c-.2.176-.492.246-.686.246-.275 0-.375-.193-.304-.533zM9 4.5a1 1 0 1 1-2 0 1 1 0 0 1 2 0"/&gt;
&lt;/svg&gt;Prerequisites
 &lt;/p&gt;</description></item></channel></rss>