# Fuzzy set approaches

Data classification models are a very popular form of data analysis. Common classification methods include tree...

induction, Bayesian classification and belief networks, and neural networks. This tip, from Han and Kamber's book Data Mining Concepts and Techniques (Morgan Kaufman), discusses a less popular method, but one that is growing in appeal: fuzzy logic.

Rule-based systems for classification have the disadvantage that they involve sharp cutoffs for continuous attributes. For example, consider the following rule for customer credit application approval. The rule essentially says that applications for customers who have a job for two or more years and who have a high income (i.e., of at least \$50K) are approved:

IF (years_employed >= 2) ^ (income >= 50K) THEN credit = "approved"

With this rule, a customer who has had a job for at least two years will receive credit if her income is, say \$50K, but not if it is \$49K. Such harsh thresholding may seem unfair. Instead, fuzzy logic can be introduced into the system to allow "fuzzy" thresholds or boundaries to be defined. Rather than having a precise cutoff between categories or sets, fuzzy logic uses truth values between 0.0 and 1.0 to represent the degree of membership that a certain value has in a given category. Hence, with fuzzy logic, we can capture the notion that an income or \$49K is, to some degree, high, although not as high as an income of \$50K.

Fuzzy logic is useful for data mining systems performing classification. It provides the advantage of working at a high level of abstraction. In general, the use of fuzzy logic in rule-based systems involves the following:

• Attribute values are converted to fuzzy values. Values for the continuous attribute income are mapped into the discrete categories {low, medium, high}, and the fuzzy membership or truth values are calculated. Fuzzy logic systems typically provide graphical tools to assist users in this step.
• For a given new sample, more than one fuzzy rule may apply. Each applicable rule contributes a vote for membership in the categories. Typically, the truth values for each predicted category are summed.
• The sums obtained above are combined into a value that is returned by the system. This process may be done by weighing each category by its truth sum and multiplying by the mean truth value of each category. The calculations involved may be more complex, depending on the complexity of the fuzzy membership graphs.

Fuzzy logic systems have been used in numerous areas for classification, including health care and finance.

This was first published in May 2001

## Content

Find more PRO+ content and other member only offers, here.

Oldest

• ### SQL engines boost Hadoop query processing for big data users

Organizations with big data environments are turning to SQL-on-Hadoop software to speed up analytical queries and data ...

• ### Reality check needed to assess AI applications

When assessing the reality behind today's AI technology, businesses need to think about how it can perform in specific tasks ...

When it comes to building a data science team, businesses should expect to find workers from a variety of backgrounds rather than...

## SearchDataManagement

• ### Four factors for comparing the top Hadoop distributions

By examining the key characteristics presented here -- along with the top Hadoop distributions -- you can determine which ...

• ### Big data challenges traditional data modeling techniques

Surging big data is changing data modeling techniques, including schema creation. The word from Enterprise Data World 2016: Data...

• ### EBay helps drive new style of data engineering

Open source data engineering has become a way of life at e-commerce leader eBay, says the company's Debashis Saha. Kylin is one ...

## SearchSAP

• ### Integrate cloud to on-premises with HANA Cloud Integration

SAP offers a raft of prebuilt integrations that handle many of the key business processes between major cloud and on-premises ...

• ### Courtroom lessons from a failed SAP ERP implementation

A consulting firm's expert witness explains what SAP and a global manufacturer did -- and didn't do -- that led to a major SAP ...

• ### Building data visualization with SAP Fiori tools

Some BI developers will get by fine with features such as the Fiori Launchpad and Overview pages. Here's what's built into Fiori ...

## SearchOracle

• ### ECCU shares ups, downs of Oracle Fusion Financials migration

Moving to Oracle Fusion Financials has been a mixed blessing for the Evangelical Christian Credit Union. It saved money, but had ...

• ### Oracle Enterprise Manager 13c gives DBAs new cloud tools

The latest version of Oracle Enterprise Manager is designed to make life easier for DBAs working in the cloud. Oracle Enterprise ...

• ### Oracle Collaborate conference generates buzz in Las Vegas

The Oracle database and applications forum, presented by three independent user groups, combines hands-on experiences, networking...

## SearchAWS

• ### AWS, partners' balancing act weighs on users, too

AWS partners are a critical part of the growing ecosystem, but the choice between third-party services and the waiting game for ...

• ### Words to go: AWS data storage

If you're confused about which data storage option is ideal for your enterprise, refer to our reference sheet on AWS tools and ...

• ### Using AWS CloudFormation templates to deploy IAC

CloudFormation templates help IT teams quickly and safely update AWS-based applications. The time saved on the back end could ...

## SearchContentManagement

• ### Office 365 services present confusing overlaps

Office 365 services, including SharePoint, OneDrive, offer overlapping capabilities. Things can get confusing for prospective ...

• ### The state of three legal document management tools

Microsoft and other document management providers have updated their software to meet the needs of the legal industry. Here's how.

• ### SharePoint Online Public Website drops off, pricing doesn't

Microsoft is killing off support for SharePoint public websites, and companies will now have to go to third parties for support. ...

Close