Хелпикс

Главная

Контакты

Случайная статья





Task 1. Mark the following statements as True or False.



Task 1. Mark the following statements as True or False.

1) Desktop organizers are programs that require desktop computers.

2) Computers are sometimes used to monitor systems that previously needed

human supervision.

3) Networking is a way of allowing otherwise incompatible systems to

communicate and share resources.

4) The use of computers prevents people from being creative.

5) Computer users do not have much influence over the way that computing

develops.

 

DATA MINING

Data mining is simply filtering through large amounts of raw data for useful

information that gives businesses a competitive edge. This information is made up of meaningful patterns and trends that are already in the data but were previously

unseen.

The most popular tool used when mining is artificial intelligence (AI). AI technologies try to work the way the human brain works, by making 10 intelligent

guesses, learning by example, and using deductive reasoning. Some of the more

popular AI methods used in data mining include neural networks, clustering, and

decision trees.

Neural networks look at the rules of using data, 15 which are based on the connections found or on a sample set of data. As a result, the software continually

analyses value and compares it to the other factors, and it compares these factors

repeatedly until it finds patterns emerging. These 20 patterns are known as rules. The software then looks for other patterns based on these rules or sends out an alarm when a trigger value is hit.

Clustering divides data into groups based on similar features or limited data

ranges. Clusters are used when data isn't labelled in a way that is favourable to

mining. For instance, an insurance company that wants to find instances of fraud

wouldn't have its records labelled as fraudulent or not fraudulent. But after analysing patterns within clusters, the mining software can start to figure out the rules that point to which claims are likely to be false.

Decision trees, like clusters, separate the data into subsets and then analyse the

subsets to divide them into further subsets, and so on (for a few more levels). The

final subsets are then small enough that the mining process can find interesting

patterns and relationships within the data.

Once the data to be mined is identified, it should be cleansed. Cleansing data

frees it from duplicate information and erroneous data. Next, the data should be

stored in a uniform format within relevant categories or fields. Mining tools can

work with all types of data storage, from large data warehouses to smaller desktop

databases to flat files. Data warehouses and data marts are storage methods that

involve archiving large amounts of data in a way that makes it easy so to access when necessary.

When the process is complete, the mining software generates a report. An

analyst goes over the report to see if further work needs to be done, such as refining parameters, using other data analysis tools to examine the data, or even scrapping the data if it's unusable. If no further work is required, the report proceeds to the decision makers for appropriate action.

The power of data mining is being used for many purposes, such as analysing

Supreme Court decisions, discovering patterns in health care, pulling stories about

competitors from newswires, resolving bottlenecks in production processes, and

analysing sequences in the human genetic makeup. There really is no limit to the type of business or area of study where data mining can be beneficial.

 

Task 1 . Mark the following as True or False:

1) Data mining is a process of analysing known patterns in data,

2) Artificial intelligence is commonly used in data mining,

3) In data mining, patterns found while analyzing data are used for further

analysing the data,

4) Data mining is used to detect false insurance claims,

5) Data mining is only useful for a limited range of problems.

 



  

© helpiks.su При использовании или копировании материалов прямая ссылка на сайт обязательна.