Concurrent Completes the Big Data Hat Trick for Hadoop ApplicationsMay 22, 2013 No Comments
SAN FRANCISCO – May 21, 2013 – Concurrent, Inc., the enterprise Big Data application platform company, today introduced Pattern, a free, open source, standard-based scoring engine that enables analysts and data scientists to quickly deploy machine-learning applications on Apache Hadoop™. Leveraging the power and broad platform support of the Cascading application framework, Pattern lowers the barrier to Hadoop adoption by enabling companies to leverage existing intellectual property (IP) in predictive models, existing investments in software tooling and the core competencies of existing analytics staff to run Big Data applications from existing machine-learning models using Predictive Model Markup Language (PMML) or through a simple programming interface.
Hadoop is rapidly becoming the tool of choice for tackling enterprise Big Data analytics needs in an effort to make the most of growing volumes of unstructured and semi-structured data. The need for Hadoop to easily integrate with existing data management and analytics systems, however, has created a real barrier to comprehensive Hadoop adoption.
Enter Pattern: PMML for Cascading and Hadoop
With the introduction of Pattern, companies can now leverage existing skill sets, core competencies and product investments by carrying them over to Hadoop via the standards-based PMML technology. PMML is the standard export format for tools, such as R, MicroStrategies® and SAS®; and with Pattern, analysts and data scientists familiar with these technologies can now run predictive data models at scale and integrate ETL, data preparation and predictive analytics in the same application to greatly reduce development time and unlock accessibility to large Hadoop data sets. Pattern in turn will enable a whole new class of use cases and simplify experiments.
Pattern runs on Cascading, the most widely used and deployed application framework for building robust, enterprise Big Data applications on Hadoop. Recognized companies, including The Climate Corporation, eBay, Etsy, FlightCaster, iCrossing, Razorfish, Trulia, TeleNav and Twitter, are using Cascading to streamline data processing, data filtering and workflow optimization for large volumes of unstructured and semi-structured data. Cascading is also at the core of popular language extensions including PyCascading (Python + Cascading), Scalding (Scala + Cascading) and Cascalog (Clojure + Cascading) – open source projects sponsored by Twitter. Cascading has become the most reliable and repeatable way of building and deploying Big Data applications.
By leveraging the Cascading framework, enterprises can apply Java, SQL and predictive modeling investments, and combine the respective outputs of multiple departments into a single application on Hadoop. This is a powerful step forward in delivering on the full promise of the business of Big Data.
“Pattern facilitates AgilOne to deploy a variety of advanced machine-learning algorithms for our cloud-based predictive marketing intelligence solution. As a self-service SaaS offering, Pattern allows us to evaluate multiple models and push the clients’ best models into our high performance scoring system. The PMML interface allows our advanced clients to deploy custom models.”
-Antony Arokiasamy, Senior Software Architect, AgilOne
“Concurrent is tearing down barriers for mass Hadoop adoption. With Pattern, we have cleared another path by enabling data scientists to more easily bring their work to production. When combined, Cascading, Lingual and Pattern close the modeling, development and production loop for all data oriented applications. The combination of the three is the application ensemble for further enabling enterprises to drive differentiation through data.”
-Chris Wensel, CTO and Founder, Concurrent, Inc.
Pattern website: http://www.cascading.org/pattern
Cascading website: http://www.cascading.org
Lingual website: http://www.cascading.org/lingual
Contact Us: http://www.concurrentinc.com/contact
Follow us on Twitter: http://twitter.com/concurrent
Availability and Pricing
Pattern is a free, open source software and available under the Apache 2.0 License. To learn more about the Pattern project, visit http://www.cascading.org/pattern. Concurrent also offers standard and premium support subscriptions for enterprise use. To learn more about Concurrent’s offerings, please visit http://www.concurrentinc.com.
About Concurrent, Inc.
Concurrent, Inc.’s vision is to become the #1 software platform choice for Big Data applications. Concurrent builds application infrastructure products that are designed to help enterprises create, deploy, run and manage data processing applications at scale on Apache Hadoop.
Concurrent is the mind behind Cascading™, the most widely used and deployed technology for Big Data applications with more than 75,000+ user downloads a month. Used by thousands of data driven businesses including Twitter, eBay, The Climate Corp and Etsy, Cascading is the de-facto standard in open source application infrastructure technology. Concurrent is headquartered in San Francisco. Visit Concurrent online at http://www.concurrentinc.com.DATA and ANALYTICS , News