Skip to main content

Startup Showcase finalists at Strata + Hadoop World 2012

A quick look at Startup Showcase Finalist Companies at this year's O'Reilly Strata Conference and Hadoop World to be held in New York City on Oct 23rd, 2012.

Let's glance through: 



Location: Bucharest, Romania
Founded: 2012
At Axemblr the focus is on building the best service that can deploy Apache Hadoop on-demand on your cloud of choice, either public or private. 
On top of that service they are building a marketplace for algorithms, queries and data sets that you can run on-demand. Simplicity and robustness are key characteristic.



Location: Montreal, Canada
Founded: February 1, 2010
Funding: $2M
Datacratic is software company that applies real-time machine learning and predictive modeling to big data generated from consumer behavior. Datacratic provides optimization for Data Management Platforms (DMPs), Demand Side Platforms (DSPs), Agency Trading Desks (ATDs), e-commerce web sites and others in the real-time marketing ecosystem. Originally founded as Recoset in February 2010 and headquartered in Montreal, Datacratic is privately held with backing from Real Ventures and BDC Venture Capital. 



Location: Cambridge, Massachusetts, United States
Founded: 2010
Funding: $9.5M
Hadapt offers an adaptive analytical platform for performing complex analytics on structured and unstructured data, all in one cloud-optimized system. For more information look at HadoopSphere's post earlier on Hadapt.



Location: Chicago, Illinois, United States
Founded: 2012
Infoactive makes it super easy to turn your data into interactive graphics that shine. It is a Chicago startup providing an online platform for creating interactive, mobile-friendly infographics. InfoActive will help people that have “an interest in visualizing data but don’t know where to start, or those who think it’s difficult to create beautiful, data-driven stories.” 

Mortar Data


Location: New York, New York, United States
Founded: June, 2010
Funding: $1.8M
Mortar Data's Hadoop PaaS gets software engineers and data scientists up and running in an hour with no special training. Mortar makes custom big data work exceptionally easy by leveraging existing skills and tools. Using APIs, Mortar is easy to integrate into your data pipeline. And Mortar is a cloud service, so you pay for just what you need, never hit a ceiling, and never need to worry about scaling again.

Next Big Sound


Location: Nashville, Tennessee, United States
Founded: June 20, 2008
Funding: $7.5M
Launched in 2009, Next Big Sound is the leading provider of online music analytics and insights, tracking hundreds of thousands of artists across all major web properties (YouTube, MySpace, Facebook, Twitter, Wikipedia, etc.). For the first time, people can track and artist’s .com traffic, unit sales and radio data along side social media networks to measure results and gain actionable insight from activity, both online and off. The Next Big Sound Premier platform allows the measurement and granular visualization of fan density, sales and site traffic in powerful graphs, by geographic region, as well as weekly email reports to help inform decisions. Next Big Sound works with a variety of businesses ranging 
from individual artists and managers to the biggest record labels in the world.



Location: Jersey City, New Jersey, United States
Funding: $2.5M
NGDATA is the consumer intelligence company that empowers enterprises seeking greater customer lifetime value by enabling deep customer insights, personalized product offers and intimate customer experience to drive sales, and increase customer loyalty with a unique combination of interactive Big Data management and machine learning technologies in one integrated solution.



Location: Seattle, Washington, United States
Founded: January 19, 2011
Funding: $3.4M
Placed is Location Analytics. By connecting the physical and digital worlds, Placed is creating a new class of analytics focused on location. Placed Analytics, a free service, anonymously measures, aggregates, and analyzes the paths and places people visit in the physical world. Founded in February 2011, Placed is headquartered in Seattle and is backed by Madrona Venture Group.



Location: Boulder, Colorado, United States
Founded: October 25, 2010
Funding: $2.77M
Precog is a data analysis platform that helps companies productize their data assets. Precog allows companies to store, integrate, and analyze large volumes of measured data. Featuring the industry-leading implementation of Quirrel, the “R for big data” statistical language, the Precog platform allows companies to create analytics or predictive models that can be deployed as new products or features inside existing products.
Companies use Precog to build analytics and reporting into their applications, to turn existing data assets into new data products that can then be resold, and to build advanced analytical features like recommendations and personalizations.

Privacy Analytics 


Location: Ottawa, Canada
Founded: 2007
Privacy Analytics offers HIPAA de-identification certification services along with a unique data masking and de-identification tool. 

Privacy Analytics consulting services include: 
1. Conducting re-identification risk assessments on actual databases 
2. Performing re-identification attacks on de-identified data sets 
3. Developing guidelines and recommendations for de-identifying data 
4. De-identifying data sets and certifying the level of risk


Location: Virginia, United States
Founded: 2012
RedCarp's product, Iceberg Engine, provides outstanding analysis for talent management with informative dashboard and real-time insights. Employees provide answers from a smart phone, tablet, or PC. Iceberg Engine processes, transforms, and analyzes data collected from employees and organizations.



Location: Chicago, United States
Founded: 2012
Tempo is the database service for time-series data. We make it possible to store and instantly analyze the massive streams of measured data that break traditional databases. Tempo was a member of the inaugural TechStars Cloud class of 2012, a startup accelerator focused on internet infrastructure.



Location: Paris, France
Founded: May 10, 2011
Funding: $500k
qunb is a one-stop-shop for figures. Its mission is to provide a simple search engine for everything stats-, data- or figures-related to its users.qunb agregates data from thousands of sources, and makes them accessible to potentially everyone. Using a search bar and simple drag & drop interface, qunb provides beautiful, customizable, and easy to export graphs on any topics of interest.



Location: San Francisco, California, United States
Funding: $5M
WibiData, the big data management startup. Bisciglia says that customers can use the data analysis platform to create personalized products and services. And the technology easily integrates with business intelligence and database offerings.

Information sourced from CrunchBase and LinkedIn

And the winners are:

Tim O’Reilly’s pick: Privacy Analytics

Fred Wilson’s pick: Placed

Audience Choice: Hadapt

Audience Choice Runner-up: Precog

Update posted on 29 Oct:
1 - Infoactive information corrected
2 - Winners listed


  1. Think you got the wrong InfoActive in this article. The domain is and it's a completely different company.

  2. Thank you for pointing to the right url. This also points to the importance of having a unique domain name and company name to avoid conflict with similar business.

    Regardless, Infoactive sounds promising - the pitch almost sounds like a Kickstarter analytics project.


Post a Comment

Popular articles

5 online tools in data visualization playground

While building up an analytics dashboard, one of the major decision points is regarding the type of charts and graphs that would provide better insight into the data. To avoid a lot of re-work later, it makes sense to try the various chart options during the requirement and design phase. It is probably a well known myth that existing tool options in any product can serve all the user requirements with just minor configuration changes. We all know and realize that code needs to be written to serve each customer’s individual needs. To that effect, here are 5 tools that could empower your technical and business teams to decide on visualization options during the requirement phase. Listed below are online tools for you to add data and use as playground. 1)      Many Eyes : Many Eyes is a data visualization experiment by IBM Research and the IBM Cognos software group. This tool provides option to upload data sets and create visualizations including Scatter Plot, Tree Ma

Data deduplication tactics with HDFS and MapReduce

As the amount of data continues to grow exponentially, there has been increased focus on stored data reduction methods. Data compression, single instance store and data deduplication are among the common techniques employed for stored data reduction. Deduplication often refers to elimination of redundant subfiles (also known as chunks, blocks, or extents). Unlike compression, data is not changed and eliminates storage capacity for identical data. Data deduplication offers significant advantage in terms of reduction in storage, network bandwidth and promises increased scalability. From a simplistic use case perspective, we can see application in removing duplicates in Call Detail Record (CDR) for a Telecom carrier. Similarly, we may apply the technique to optimize on network traffic carrying the same data packets. Some of the common methods for data deduplication in storage architecture include hashing, binary comparison and delta differencing. In this post, we focus o

In-memory data model with Apache Gora

Open source in-memory data model and persistence for big data framework Apache Gora™ version 0.3, was released in May 2013. The 0.3 release offers significant improvements and changes to a number of modules including a number of bug fixes. However, what may be of significant interest to the DynamoDB community will be the addition of a gora-dynamodb datastore for mapping and persisting objects to Amazon's DynamoDB . Additionally the release includes various improvements to the gora-core and gora-cassandra modules as well as a new Web Services API implementation which enables users to extend Gora to any cloud storage platform of their choice. This 2-part post provides commentary on all of the above and a whole lot more, expanding to cover where Gora fits in within the NoSQL and Big Data space, the development challenges and features which have been baked into Gora 0.3 and finally what we have on the road map for the 0.4 development drive. Introducing Apache Gora Although