Skip to main content

Startup Showcase finalists at Strata + Hadoop World 2013

Strata Conference + Hadoop World 2013 is now underway at New York City from 28th October 2013 onwards. We profile the finalists at the startup showcase event. The selections this time around are focused primarily on analytics providing insights of variant nature. You may be surprised or wowed at how Hadoop and related technologies can be used to fortify cyber security, provide location intelligence, improve supply chain or for that matter, just do behavior analysis of your child.


Affinio Inc.:

Affinio is an advanced NoSQL database technology enabling low-cost, real-time processing of social network graphs to determine how every person on the web is connected.
Affinio's first product, Tactics Cloud is a social network insights platform for marketing, advertising, sales, BD and BI teams that want to gain actionable insights into the social networks of their leads, customers, and competitors.
Founded:2013
Main Location:Canada
Employees:1-10 employees
http://www.affin.io

Alpine Data Labs:

Alpine claims to offer the world’s first script-less and web-based advanced analytics software for Big Data and Hadoop. With Alpine, Data Scientists and Business Analysts can work with large data sets, develop, refine models and collaborate at scale without having to use code or download software.
Founded:2010
Main Location:San Mateo, CA, USA
Employees:11-50 employees
http://www.alpinedatalabs.com

Appuri, Inc:

Appuri is the Big Data Stack for Customer Engagement at Scale. Appuri is a hosted smart data pipeline that loads your data warehouse and provides three things:unified profile of every Customer by collecting data from internal and external sources; real-time visualizations that give a holistic view of customer behavior; and prescriptive actions that orchestrate internal and external systems to optimize engagement.
Founded:2012
Main Location:Seattle, WA, USA
Employees:1-10 employees
http://www.appuri.com

Fortscale:

Fortscale’s goal is to enable enterprises to easily run big data analytics for cyber security, regardless of their technical know-how. The company strives to improve cyber security teams’ effectiveness by delivering risk-prioritized analysis and visualization of user behavior and access activity.
Founded:2012
Main Location:Tel Aviv, Israel
Employees:10-50 employees
http://www.fortscale.com

Hazelcast:

Hazelcast is an open source clustering and highly scalable caching solution for in-memory data grid. It allows to easily cache, share application state and partition application data across the cluster. Hazelcast is a peer-to-peer solution (there is no master node, every node is a peer) so there is no single point of failure. Applications running Hazelcast will dynamically cluster and create a single system view. Hazelcast also provides distributed executor service to parallelize the processing of distributed data.
Founded:2008
Main Location:Istanbul, Turkey
Employees:11-50 employees
http://www.hazelcast.com/

Inovae / DataToWeb:

DataToWeb is the Internet of Things monitoring platform that allows individuals and organizations to collect, store and analyze their time series data online, like for energy analytics.
Founded:2008
Main Location:Canada
Employees:1-10 employees
https://www.datatoweb.com

JethroData:

JethroData’s jetBase is a patent pending next generation bigdata analytics database, that is built natively into Hadoop’s distributed file system and combines the affordability and scalability of Hadoop’s storage with the power of a fully indexed columnar database. It revolutionizes the way companies analyze Big Data, by introducing a database technology that addresses both the need to store vast amounts of raw data and the need for lightning fast queries. Running natively in Hadoop, JethroData enables organizations to use a single system to store and analyze data in real-time, and combines the scalability of Hadoop storage with the performance of an analytic DB.   

JethroData also featured in the earlier edition of Strata conference Hadoop World in Feb 2013.
Founded:2012
Main Location:Netanya, Israel
Employees:1-10 employees
http://www.jethrodata.com

Metric Insights:

Metric Insights bridges the last mile to business intelligence and Big Data.Metric Insights lets users cut through the noise, focus immediately on the critical business issues that warrant their attention, and take action. Its Push Intelligence platform connects quickly and easily to existing business intelligence tools, big data and SaaS applications. It uniquely delivers a patented KPI warehouse, collaboration and notification technologies that tell you when your key business metrics have changed, and, more importantly, why.
Founded:2010
Main Location:San Francisco, CA, USA
Employees:1-10 employees
http://www.metricinsights.com

Mevoked:

Mevoked has developed a unique platform that helps to  determine the emotional state of mind of the user based on their browsing patterns/content consumed. They are using this technology to develop a behavior monitoring tool for children ages 6 - 17 and use it as an early warning tool for parents and educators. Unlike normal parental controls which block content, Mevoked is creating an environment where parents can be alerted to signs of depression and/or negative behavior/moods in real-time. The goal is to create a positive reinforcement loop starting by informing parents/educators/guardians about their children’s state of mind.
Founded:2012
Main Location:San Francisco, CA, USA
Employees:1-10 employees
http://www.mevoked.com

PlaceIQ:

PlaceIQ claims to be a leading provider of location intelligence, enabling advertisers to reach and define mobile brand audiences at scale for a wide range of marketing activities. Working with agencies, brands, and channel partners, PlaceIQ deploys proprietary, patent-pending big data science to aggregate and analyze extensive amounts of location data from multiple sources. The resulting intelligence gives marketers an unprecedented understanding of consumer behavior, while offering a privacy-friendly way to define, locate, reach and measure mobile audiences.
Founded:2010
Main Location:New York, USA
Employees:51-200 employees
http://www.placeiq.com

Sqrrl:

Sqrrl is a Big Data company that provides products for Apache Accumulo. Sqrrl is focused on Big Data challenges both in terms of scale and analytic complexity where data is constrained by legal, regulatory, or other restrictions. Its products promote data integration, secure information sharing, and data-driven solutions. Their core focus is on secure, scalable, and and adaptable Big Data platform built on Apache Accumulo.
Founded:2012
Main Location:Cambridge, MA, USA
Employees:11-50 employees
http://www.sqrrl.com

WebKite:

WebKite transforms data into a dynamic, interactive, intuitive website. Servicing businesses small and large, WebKite is able to take information and turn it into the flexible and customizable content management platform.WebKite is built on the foundation of tools that can help a business in four distinct areas: user empowerment, community building, website growth, and brand alignment. The WebKite platform empowers users to educate their community with built-in decision tools such as multi-facet search and side-by-side comparisons.
Founded:2012
Main Location:Pittsburgh, PA, USA
Employees:11-50 employees
http://www.webkite.com/


Comments

Popular posts from this blog

Data deduplication tactics with HDFS and MapReduce

As the amount of data continues to grow exponentially, there has been increased focus on stored data reduction methods. Data compression, single instance store and data deduplication are among the common techniques employed for stored data reduction.
Deduplication often refers to elimination of redundant subfiles (also known as chunks, blocks, or extents). Unlike compression, data is not changed and eliminates storage capacity for identical data. Data deduplication offers significant advantage in terms of reduction in storage, network bandwidth and promises increased scalability.
From a simplistic use case perspective, we can see application in removing duplicates in Call Detail Record (CDR) for a Telecom carrier. Similarly, we may apply the technique to optimize on network traffic carrying the same data packets.
Some of the common methods for data deduplication in storage architecture include hashing, binary comparison and delta differencing. In this post, we focus on how MapReduce and…

5 online tools in data visualization playground

While building up an analytics dashboard, one of the major decision points is regarding the type of charts and graphs that would provide better insight into the data. To avoid a lot of re-work later, it makes sense to try the various chart options during the requirement and design phase. It is probably a well known myth that existing tool options in any product can serve all the user requirements with just minor configuration changes. We all know and realize that code needs to be written to serve each customer’s individual needs.
To that effect, here are 5 tools that could empower your technical and business teams to decide on visualization options during the requirement phase. Listed below are online tools for you to add data and use as playground.
1)      Many Eyes: Many Eyes is a data visualization experiment by IBM Researchandthe IBM Cognos software group. This tool provides option to upload data sets and create visualizations including Scatter Plot, Tree Map, Tag/Word cloud and ge…

Pricing models for Hadoop products

A look at the various pricing models adopted by the vendors in the Hadoop ecosystem. While the pricing models are evolving in this rapid and dynamic market, listed below are some of the major variations utilized by companies in the sphere.
1) Per Node:Among the most common model, the node based pricing mechanism utilizes customized rules for determining pricing per node. This may be as straight forward as pricing per name node and data node or could have complex variants of pricing based on number of core processors utilized by the nodes in the cluster or per user license in case of applications.
2) Per TB:The data based pricing mechanism charges customer for license cost per TB of data. This model usually accounts non replicated data for computation of cost.
3) Subscription Support cost only:In this model, the vendor prefers to give away software for free but charges the customer for subscription support on a specified number of nodes. The support timings and level of support further …