Paxata, Inc. provides adaptive data preparation platform that delivers raw data to ready data for business analysts in the United States and internationally. Its platform enables users to connect, explore, transform, and combine data on their own or work with peers in a shared and transparent environment as they shape data for analytics. The company also provides Data Time Machine, a cloud-based data governance solution that provides security, versioning, and usage visibility. Paxata, Inc. was founded in 2012 and is headquartered in Redwood City, California.
2317 Broadway Street
Redwood City, CA 94063
Founded in 2012
Paxata and Carahsoft Partner to Fill the Gap in Big Data Analytics Stack
Mar 12 15
Paxata announced its partnership with Carahsoft Technology Corp. to bring the award-winning Paxata Adaptive Data Preparation application and platform to government agencies to help them automate, collaborate and dynamically govern the data integration, data quality and enrichment process in a self-service fashion. Carahsoftjoins existing Paxata government partner In-Q-Tel, a not-for-profit, strategic investment firm that works to identify, adapt, and deliver innovative technology solutions to support the missions of the U.S. Intelligence Community, within the Paxata partner program to address the tremendous increase in demand from State, Local and Federal government organizations for a solution that addresses data preparation at scale.
Paxata, Inc. Unveils Platform Release of Adaptive Data Preparation at Scale
Oct 14 14
Paxata unveiled the general availability of its 2014 Fall release of its platform, which scales to support massive data volumes while maintaining easy-to-use data integration, quality and enrichment capabilities that reduce the time to analytics-ready data. The new Paxata release is offered with flexible deployment models for elastic multi-tenant public and private cloud with simple, scalable pricing. The Paxata solution has evolved from the first self-service data preparation application for all analysts, into a powerful in-memory enterprise data preparation platform built on top of Apache Spark(TM) which provides business-ready data at scale, addressing all analytic use cases including ad-hoc, packaged analytic applications, predictive analytics solutions and operational reporting. While Paxata's initial focus was solely on helping individual business analysts who used Microsoft Excel and SQL scripting and coding to address their basic data cleanup needs, organizations quickly found Paxata valuable across a broader range of their users including technical data analysts, data architects and data scientists. This drove an expanded feature set to address a broader set of use cases, as well as connectivity to a broader set of data sources, including Hadoop clusters, relational databases and Salesforce.com. The Fall 2014 release, a generally available release being demonstrated at Strata+Hadoop World, combines self-service data preparation easy enough for any business analyst to use, with an enterprise data preparation platform powerful enough to satisfy the demands of data architects and data scientists who want to dramatically increase their analytic productivity on ever-increasing data volumes. The new release provides a single unified platform for data integration, quality, enrichment, collaboration and governance, built on a high performance real time in-memory, columnar and distributed pipeline architecture. The platform is built upon the open source Apache Spark(TM) technology and provides native integration to Apache Hadoop. This integration enables Paxata to take advantage of Hadoop's ability to handle exponential growth of data with unprecedented performance of Apache Spark. This allows organization to easily scale up and down processing providing enterprises nearly unlimited capacity to support any kind of workload need. The new Paxata release of its enterprise data preparation platform includes architectural, performance, elasticity and connectivity enhancements, including: Architecture: The Paxata adaptive data preparation platform is powered by a real-time columnar parallelized pipeline architecture on top of Apache Spark(TM). Paxata provides a unified experience that allows for a seamless transition between interactive and batch execution models to advanced compilation and caching techniques. Performance: The Paxata platform is built to support exponential growth of data. The latest release supports interactive performance on multi-million row volumes and batch performance on significantly higher row volumes only limited by cluster size with scalability increasing linearly with additional cluster resources. In addition, the platform supports an increased number of concurrent user queries to the pipeline server with consistent response times. Elasticity: The Paxata platform easily scales up/down based on workload needs, with the ability to spin up new instances of the pipeline server in the public or private cloud to support massive workloads or spikes in demand, as well as route to specific pipeline server to support shared services model with variable workloads. This feature provides enterprises nearly unlimited capacity to support any kind of workload need in a highly cost efficient manner. Connectivity: Native publishing of Paxata AnswerSetsTM to Hadoop ecosystem SQL systems such as Cloudera Impala for interactive querying of large data sets or Apache HiveTM for batch querying of massive datasets with Parquet persistence. This allows any ODBC/JDBC compliant solution, including Tableau, Qlik and Excel to get business-ready data interactively at scale. JDBC import for RDBMS data sources, including Oracle, MS SQL, DB2, Postgres, MySQL, and Amazon Redshift; Native import and export to HDFS in any supported file format, including AVRO, Flat file, XML, JSON: Salesforce.com integration for direct access to CRM data. In addition to the platform enhancements, a significant number of new features have been added to the self-service data preparation application including: Emergent automation that automatically records analysts' steps and allows future replay; In-line preview, validation, and error handling of all step operations with columnar data lineage; Metaphone, ngram, and fingerprint-based fuzzy joins; Find and replace with contains search functionality and boolean calculations; Comprehensive end-to-end audit from the moment data is imported into the system to the moment it's exported; and Library functionality for sharing AnswerSets with other end-users powered by tagging and search.
Paxata Announces Appointments to the Advisory Board
Apr 2 14
Paxata announced the expansion of its Advisory Board, with the addition of Kenny Mendes, an innovator in quantitative talent acquisition at Box; Sanjay Poonen, General Manager of End-User Computing at VMware; and Ameet Patel, a noted advisor and investor in early stage and high growth technology companies looking to disrupt and revolutionize enterprise computing. The expanded Advisory Board will work closely with Paxata's team to ensure that Paxata's products and strategy continue to address the evolving needs of business analysts at all technical levels. Sanjay Poonen has more than 20 years of experience in the technology industry and has held a variety of management roles in engineering, products, sales, marketing and business development. Before joining VMware, Poonen was president and corporate officer of Platform Solutions and the Mobile Division at SAP AG. Kenny Mendes leads recruiting at Box, helping the disruptive brand grow its talent acquisition efforts to support explosive business growth. Ameet Patel has been a leader in multiple facets of the technology, financial services and investment industries, having played key roles from a customer's perspective in IT as Chief Architect of Chase's consumer business and a CTO at JPMorgan Chase, participated on the investment banking side as a strategic advisor and head of LabMorgan, and now is focusing on advising and investing in early stage and high growth technology companies.