RapidMiner 4.3 extends functionality and speeds up data analysis processes

http://www.crmmanager.net/magazine/news_h34017_rapidminer_43_extends_functionality_and_speeds.html

With more than 500 basic process building blocks (operators) for data extraction, transformation, and loading (ETL) from all kinds of data sources, on-line analytical processing (OLAP), data mining, text and web mining, time series analysis and forecasting, predictive analytics, business intelligence (BI), and reporting, the open source software RapidMiner is one of the most comprehensive solutions for a wide range of data analysis, forecasting, and decision support tasks. Following the demand of the large RapidMiner user base in more than 40 countries, Rapid-I extended the new release RapidMiner 4.3 by more than 50 new operators for all steps of the data analysis process from data extraction and pre-processing to modeling, visualization, evaluation, deployment, and reporting as well as for automated process parameter optimization and process automation.

Besides the additional functionality, RapidMiner 4.3 further improves the ease of use of the software and its scalability to very large data sets significantly, speeding up the process execution. RapidMiner 4.3 can be downloaded free of charge from the Rapid-I home page ( www.rapid-i.com).

Besides the freely available RapidMiner 4.3 Community Edition, Rapid-I also offers the more powerful RapidMiner 4.3 Enterprise Edition, which extends the Community Edition by capabilities for automated reporting in HTML, PDF, RTF, and Excel format as well as by parallel data mining operators for multi-core computers and professional technical support with guaranteed response times. The third variant of RapidMiner, the Developer Edition (OEM license), allows other software vendors and solution providers to integrate RapidMiner as ETL, BI, and data mining engine into their own products and solutions, even if these products are closed-source, and to distribute and sell these products and solutions to others.

The new RapidMiner release 4.3 focuses on improvements for companies that need to analyze large volumes of data and to quickly leverage the results for decisions at the core of their business. The improved lift charts of RapidMiner 4.3 support the optimization of direct mailing and marketing campaigns, customer churn prediction and reduction, customer retention and increased customer loyalty and sales volumes as well as acquisition of new customers with regard to cost, benefit, effectiveness, and efficiency.

Modern data mining techniques help to select the customers most likely to respond positively to campaigns, the most promising communication channel(s) to contact them, and the best form of how to contact and address them. Extended pivotisations, new aggregation functions, extensive handling options for date and time attributes, simplified function-based construction of new features, optimized wizards for the easier handling of automated optimizations of data mining processes parameters, and new visualizations supporting zooming and panning enable improved data analysis and transformation processes and ease the use of RapidMiner. Extended macros and a new result storage mechanism support the design and execution of even more complex data mining processes. In addition to data import from and export to databases already supported in previous versions like e.g. Oracle, IBM DB2, Microsoft SQL Server, MySQL, PostgreSQL, and Ingres as well as import of Excel and SPSS files, RapidMiner now also connects to Teradata data warehouses and eases the connection to Microsoft Access databases. The uni- and multi-variate time series analysis, prediction, and visualization were also improved.

RapidMiner 4.3 also provides extensive capabilities for the analysis of unstructured texts, automated text classification and e-mail routing, text mining, web mining, and automated sentiment analysis from internet discussion groups, web blogs, product reviews, and customer feedback.

08.12.2008, Nadja Mierswa, Rapid-I GmbH