Skip to main content

Startup Showcase finalists at Strata + Hadoop World 2013

Strata Conference + Hadoop World 2013 is now underway at New York City from 28th October 2013 onwards. We profile the finalists at the startup showcase event. The selections this time around are focused primarily on analytics providing insights of variant nature. You may be surprised or wowed at how Hadoop and related technologies can be used to fortify cyber security, provide location intelligence, improve supply chain or for that matter, just do behavior analysis of your child.


Affinio Inc.:

Affinio is an advanced NoSQL database technology enabling low-cost, real-time processing of social network graphs to determine how every person on the web is connected.
Affinio's first product, Tactics Cloud is a social network insights platform for marketing, advertising, sales, BD and BI teams that want to gain actionable insights into the social networks of their leads, customers, and competitors.
Founded:2013
Main Location:Canada
Employees:1-10 employees
http://www.affin.io

Alpine Data Labs:

Alpine claims to offer the world’s first script-less and web-based advanced analytics software for Big Data and Hadoop. With Alpine, Data Scientists and Business Analysts can work with large data sets, develop, refine models and collaborate at scale without having to use code or download software.
Founded:2010
Main Location:San Mateo, CA, USA
Employees:11-50 employees
http://www.alpinedatalabs.com

Appuri, Inc:

Appuri is the Big Data Stack for Customer Engagement at Scale. Appuri is a hosted smart data pipeline that loads your data warehouse and provides three things:unified profile of every Customer by collecting data from internal and external sources; real-time visualizations that give a holistic view of customer behavior; and prescriptive actions that orchestrate internal and external systems to optimize engagement.
Founded:2012
Main Location:Seattle, WA, USA
Employees:1-10 employees
http://www.appuri.com

Fortscale:

Fortscale’s goal is to enable enterprises to easily run big data analytics for cyber security, regardless of their technical know-how. The company strives to improve cyber security teams’ effectiveness by delivering risk-prioritized analysis and visualization of user behavior and access activity.
Founded:2012
Main Location:Tel Aviv, Israel
Employees:10-50 employees
http://www.fortscale.com

Hazelcast:

Hazelcast is an open source clustering and highly scalable caching solution for in-memory data grid. It allows to easily cache, share application state and partition application data across the cluster. Hazelcast is a peer-to-peer solution (there is no master node, every node is a peer) so there is no single point of failure. Applications running Hazelcast will dynamically cluster and create a single system view. Hazelcast also provides distributed executor service to parallelize the processing of distributed data.
Founded:2008
Main Location:Istanbul, Turkey
Employees:11-50 employees
http://www.hazelcast.com/

Inovae / DataToWeb:

DataToWeb is the Internet of Things monitoring platform that allows individuals and organizations to collect, store and analyze their time series data online, like for energy analytics.
Founded:2008
Main Location:Canada
Employees:1-10 employees
https://www.datatoweb.com

JethroData:

JethroData’s jetBase is a patent pending next generation bigdata analytics database, that is built natively into Hadoop’s distributed file system and combines the affordability and scalability of Hadoop’s storage with the power of a fully indexed columnar database. It revolutionizes the way companies analyze Big Data, by introducing a database technology that addresses both the need to store vast amounts of raw data and the need for lightning fast queries. Running natively in Hadoop, JethroData enables organizations to use a single system to store and analyze data in real-time, and combines the scalability of Hadoop storage with the performance of an analytic DB.   

JethroData also featured in the earlier edition of Strata conference Hadoop World in Feb 2013.
Founded:2012
Main Location:Netanya, Israel
Employees:1-10 employees
http://www.jethrodata.com

Metric Insights:

Metric Insights bridges the last mile to business intelligence and Big Data.Metric Insights lets users cut through the noise, focus immediately on the critical business issues that warrant their attention, and take action. Its Push Intelligence platform connects quickly and easily to existing business intelligence tools, big data and SaaS applications. It uniquely delivers a patented KPI warehouse, collaboration and notification technologies that tell you when your key business metrics have changed, and, more importantly, why.
Founded:2010
Main Location:San Francisco, CA, USA
Employees:1-10 employees
http://www.metricinsights.com

Mevoked:

Mevoked has developed a unique platform that helps to  determine the emotional state of mind of the user based on their browsing patterns/content consumed. They are using this technology to develop a behavior monitoring tool for children ages 6 - 17 and use it as an early warning tool for parents and educators. Unlike normal parental controls which block content, Mevoked is creating an environment where parents can be alerted to signs of depression and/or negative behavior/moods in real-time. The goal is to create a positive reinforcement loop starting by informing parents/educators/guardians about their children’s state of mind.
Founded:2012
Main Location:San Francisco, CA, USA
Employees:1-10 employees
http://www.mevoked.com

PlaceIQ:

PlaceIQ claims to be a leading provider of location intelligence, enabling advertisers to reach and define mobile brand audiences at scale for a wide range of marketing activities. Working with agencies, brands, and channel partners, PlaceIQ deploys proprietary, patent-pending big data science to aggregate and analyze extensive amounts of location data from multiple sources. The resulting intelligence gives marketers an unprecedented understanding of consumer behavior, while offering a privacy-friendly way to define, locate, reach and measure mobile audiences.
Founded:2010
Main Location:New York, USA
Employees:51-200 employees
http://www.placeiq.com

Sqrrl:

Sqrrl is a Big Data company that provides products for Apache Accumulo. Sqrrl is focused on Big Data challenges both in terms of scale and analytic complexity where data is constrained by legal, regulatory, or other restrictions. Its products promote data integration, secure information sharing, and data-driven solutions. Their core focus is on secure, scalable, and and adaptable Big Data platform built on Apache Accumulo.
Founded:2012
Main Location:Cambridge, MA, USA
Employees:11-50 employees
http://www.sqrrl.com

WebKite:

WebKite transforms data into a dynamic, interactive, intuitive website. Servicing businesses small and large, WebKite is able to take information and turn it into the flexible and customizable content management platform.WebKite is built on the foundation of tools that can help a business in four distinct areas: user empowerment, community building, website growth, and brand alignment. The WebKite platform empowers users to educate their community with built-in decision tools such as multi-facet search and side-by-side comparisons.
Founded:2012
Main Location:Pittsburgh, PA, USA
Employees:11-50 employees
http://www.webkite.com/


Comments

Popular posts from this blog

Data deduplication tactics with HDFS and MapReduce

As the amount of data continues to grow exponentially, there has been increased focus on stored data reduction methods. Data compression, single instance store and data deduplication are among the common techniques employed for stored data reduction.
Deduplication often refers to elimination of redundant subfiles (also known as chunks, blocks, or extents). Unlike compression, data is not changed and eliminates storage capacity for identical data. Data deduplication offers significant advantage in terms of reduction in storage, network bandwidth and promises increased scalability.
From a simplistic use case perspective, we can see application in removing duplicates in Call Detail Record (CDR) for a Telecom carrier. Similarly, we may apply the technique to optimize on network traffic carrying the same data packets.
Some of the common methods for data deduplication in storage architecture include hashing, binary comparison and delta differencing. In this post, we focus on how MapReduce and…

In-memory data model with Apache Gora

Open source in-memory data model and persistence for big data framework Apache Gora™ version 0.3, was released in May 2013. The 0.3 release offers significant improvements and changes to a number of modules including a number of bug fixes. However, what may be of significant interest to the DynamoDB community will be the addition of a gora-dynamodb datastore for mapping and persisting objects to Amazon's DynamoDB. Additionally the release includes various improvements to the gora-core and gora-cassandra modules as well as a new Web Services API implementation which enables users to extend Gora to any cloud storage platform of their choice. This 2-part post provides commentary on all of the above and a whole lot more, expanding to cover where Gora fits in within the NoSQL and Big Data space, the development challenges and features which have been baked into Gora 0.3 and finally what we have on the road map for the 0.4 development drive.
Introducing Apache Gora Although there are var…

Large scale graph processing with Apache Hama

Recently Apache Hama team released official 0.7.0 version. According to the release announcement, there were big improvements in Graph package. In this article, we provide an overview of the newly improved Graph package of Apache Hama, and the benchmark results that performed by cloud platform team at Samsung Electronics.

Large scale datasets are being increasingly used in many fields. Graph algorithms are becoming important for analyzing big data. Data scientists are able to predict the behavior of the customer, the trends of the market, and make a decision by analyzing the graph structure and characteristics. Currently there are a variety of open source graph analytic frameworks, such as Google’s Pregel[1], Apache Giraph[2], GraphLab[3] and GraphX[4]. These frameworks are aimed at computations varying from classical graph traversal algorithms to graph statistics calculations such as triangle counting to complex machine learning algorithms. However these frameworks have been developed…