Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1of 17

Data Mining Tool

Data mining basically means pulling out important information from huge volume of data. Data mining tools are used for the purposes of examining the data from various viewpoints and summarizing it into a useful database library. However, lately Data mining tools have become computer based applications in order to handle the growing amount of data. Data mining tools are also sometimes referred to as knowledge discovery tools. As a concept, data mining has always existed since the past and manual processes were used as data mining tools. Later with the advent of fast

processing computers, analytical software tools, and increased storage capacities automated data mining tools were developed. These data mining tools drastically improved the accuracy of analysis, data mining speed, and also brought down the costs of operation. Data mining tools essentially facilitate following major elements:

Data Warehouse
Data Warehouse And Its Applications In Agriculture Based On Rajasthan State

Mr Felix Deepak Minj [HOD Dept. of IT, Shekhawati Group of Institutions, Sikar] Introduction A Data warehouse is a repository of integrated information, available for queries and analysis. Data and information are extracted from heterogeneous sources as they are generated. This makes it much easier and more efficient to run queries over data that originally came from different sources. In other words Data warehouse is a database that is used to hold data for reporting and analysis.

A data warehouse is a single, complete and consistent data archive, extracted from different sources and made available to end-users in a form understandable and usable to them in the context of the business. A data warehouse consists of a set of subjectoriented, integrated, permanent, timedependent data providing support to managerial decision-making. Economic foundation and productivity growth depends on agricultural sectors. Agriculture is the driving force behind the way of live and source of earnings for the majority of peoples. More than 60 percents of population are living in rural areas and the majority are farmers.

The rural communities as a main producer for country food productivity and food security earn only 11 percents of Gross Domestic Product (GDP). The arrival of information age guides this country to new development strategies. National Electronics and Computer Technology Center (NECTEC) in collaboration with the Ministry of Agriculture, has launched "Agriculture Information Network" as a response to the unmet information requirements of the agricultural sector. Farmers should gain benefit from the contents provided which include risk assessment, agriculture warning system and agricultural knowledge base, which aim

to improve technology, productivity, income and stability of India agriculture sector through the age of Information Technology. The data warehouse consists of common databases and geospatial databases from various departments and organizations in the country and abroad. Farmers can get access to the contents through Internet by themselves or from groups of professional people called "Information Brokers". Abstract: First step towards understanding any agricultural system is the comprehension of relationships between

the system and numerous physical, chemical and biological factors influencing it. Any decision regarding such systems requires analytical exploration of the involved data. The exploration task is to be supported by an efficient data storage and retrieval mechanism. In this paper we have presented the case of an Agri data warehouse for this purpose. We have briefly discussed the process we adopted for establishing the data warehouse encompassing pest, pesticide and metrological data. We have also shown how implementing an OLAP tool on top of the Agri data warehouse resulted in interesting findings from a decision support point of view.

Methodology The information system will consist of several integrated sub-systems for input, storage, retrieval, analysis and output based on strong database design with its essential functions. Besides this it will include other functions such as manipulation and dissemination of information to various users. The information system, composed of set of files for use in a RDBMS and GIS will be capable of delivering accurate, useful and timely information to various applications. Design of spatial and nonspatial database will have specifications of different data fields, their logical

array and inter-relationship with subsystem database. Data warehouse Data warehouse is a repository of an organization's electronically stored data. Data warehouses are designed to facilitate reporting and analysis. This definition of the data warehouse focuses on data storage. However, the means to retrieve and analyze data, to extract, transform and load data, and to manage the data dictionary are also considered essential components of a data warehousing system. Many references to data warehousing use this broader context. Thus, an expanded definition

for data warehousing includes business intelligence tools, tools to extract, transform, and load data into the repository, and tools to manage and retrieve metadata. A data warehouse is used for answer any ad_hoc, complex, statistical or analytical queries. Data warehouse is situated at the center of a decision support system (DSS) of an organization. Data warehouse stores integrated historical data both summarized and detailed information for organization
Benefits of data warehousing

Some of the benefits that a data warehouse provides are as follows: A data warehouse provides a common data model for all data of

interest regardless of the data's source. This makes it easier to report and analyze information than it would be if multiple data models were used to retrieve information such as sales invoices, order receipts, general ledger charges, etc. Prior to loading data into the data warehouse, inconsistencies are identified and resolved. This greatly simplifies reporting and analysis. Information in the data warehouse is under the control of data warehouse

users so that, even if the source system data is purged over time, the information in the warehouse can be stored safely for extended periods of time. Because they are separate from operational systems, data warehouses provide retrieval of data without slowing down operational systems. Data warehouses can work in conjunction with and, hence, enhance the value of operational business applications, notably customer relationship management (CRM) systems. Data warehouses facilitate decision support system applications such as trend reports (e.g., the items with the

most sales in a particular area within the last two years), exception reports, and reports that show actual performance versus goals. Data mart A data mart is a subset of an organizational data store, usually oriented to a specific purpose or major data subject, that may be distributed to support business needs. Data marts are analytical data stores designed to focus on specific business functions for a specific community within an organization. Data marts are often derived from subsets of data in a data warehouse, though in the bottom-up data warehouse design methodology the

data warehouse is created from the union of organizational data marts. A data mart is a data repository that may or may not derive from a data warehouse and that emphasizes ease of access and usability for a particular designed purpose. In general, a data warehouse tends to be a strategic but somewhat unfinished concept; a data mart tends to be tactical and aimed at meeting an immediate need. There can be multiple data marts inside a single corporation; each one relevant to one or more business units for which it was designed. Data marts may or may not be dependent or related to other

data marts in a single corporation. If the data marts are designed using conformed facts and dimensions, then they will be related. In some deployments, each department or business unit is considered the owner of its data mart including all the hardware, software and data. This enables each department to use, manipulate and develop their data any way they see fit; without altering information inside other data marts or the data warehouse. In other deployments where conformed dimensions are used, this business unit ownership will not hold true for shared dimensions like customer, product, etc.

Reasons for creating a data mart

Easy access to frequently needed data Creates collective view by a group of users Improves end-user response time Ease of creation Lower cost than implementing a full Data warehouse Potential users are more clearly defined than in a full Data warehouse

OLAP: OLAP allows business users to slice and dice data at will. Normally data in an organization is distributed in multiple data sources and are incompatible with

each other. A retail example: Point-ofsales data and sales made via call-center or the Web are stored in different location and formats. It would a time consuming process for an executive to obtain OLAP reports such as - What are the most popular products purchased by customers between the ages 15 to 30?

You might also like