Johannesburg, 15 Feb 2011
EMC Corporation, the world leader in information infrastructure solutions, has introduced a free Community Edition of the EMC Greenplum database, the industry-leading, high-performance massively parallel processing (MPP) database product, along with free analytic algorithms and data mining tools. Free downloads of the announcement are available at http://community.greenplum.com.
Building on earlier Greenplum 'Big Data' breakthroughs, like the EMC Greenplum Data Computing Appliance, the new EMC Greenplum Community Edition removes the cost barrier to entry for big data power tools empowering large numbers of developers, data scientists, and other data professionals. This free set of tools enables the community to not only better understand their data, gain deeper insights and better visualise insights, but to also contribute and participate in the development of next-generation tools and solutions.
With the Community Edition stack, developers can build complex applications to collect, analyse and operationalise big data, thereby leveraging best-of-breed big data tools, including the Greenplum database with its in-database analytic processing capabilities.
Luke Lonergan, CTO and vice-president, EMC Data Computing Products Division and co-founder of Greenplum, explained: “Our new Community Edition provides a parallel-everything Big Data stack with unequalled speed which enables analysts to perform next-generation data analytics and experiment with real-world data, and most importantly - innovate. This project is about empowering developers - they can program using the most popular tools and they have a place to contribute open source extensions to the stack.”
The free EMC Greenplum Community Edition includes:
1) Greenplum Database CE, an industry-leading massively parallel processing (MPP) database product for large-scale analytics and next-generation data warehousing.
2) MADlib, the open source analytic algorithms library, providing data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data.
3) Alpine Miner, an up-and-coming third-party analytics tool, an intuitive visual data mining modeller that delivers rapid 'modelling to scoring capabilities, leverages in-database analytics, and is purpose-built for big data applications.
Community benefits
The initial release of the EMC Greenplum Community Edition is designed for both first-time users and experienced Greenplum customers. First-time users gain access to a comprehensive, purpose-built business analytics environment that enables them to view, modify and enhance demo data files, enabling experimentation with Big Data analytical tools within the Greenplum database. Existing users can download an upgraded version of Greenplum Database CE and analytic tools for integration into their development and research environments.
The Community Edition can be downloaded as a pre-configured VMware virtual appliance for use on laptops and desktops, or as a set of packages for deployment on user machines. All users are free to participate in new Greenplum Community Forums to get support, collaborate, post ideas, and test enhancements developed by various users independently.
Availability:
Starting 1 February 2011, the EMC Greenplum Community Edition can be downloaded free of charge from http://community.greenplum.com. Regular Community Edition updates will be made available online. The Community Edition is intended for experimentation, development and research purposes only. Current Single-Node Edition users can deploy the new Community Edition in their single-node production environments. Greenplum commercial licences must be purchased prior to using code for internal data processing or for any commercial or production purpose.
Share