News & Insights

Blumberg Capital portfolio news, startup growth resources and industry insights

Home > News & Insights > Big Data File Formats Demystified

Data Analytics & Infrastructure Enterprise Software

Big Data File Formats Demystified

May 16, 2018

By Alex Woodie

So you’re filling your Hadoop cluster with reams of raw data, and your data analysts and scientists are champing at the bit to get started. Then the question hits you: How are you going to store all this data so they can actually use it?

The good news is Hadoop is one of the most cost-effective ways to store huge amounts of data. You can store all types of structured, semi-structure, and unstructured data within the Hadoop Distributed File System, and process it in a variety of ways using Hive, HBase, Spark, and many other engines.

News & Insights

Big Data File Formats Demystified

Svexa and Zone7 merge operations to offer an end-to-end AI suite of performance tools

Prescient AI raises $10M to help omnichannel brands optimize ad spend and maximize revenue

PerfectScale raises $7.1 million for its Kubernetes optimization platform

Kloudfuse Launches Out of Stealth with $23M

Investing in PerfectScale and the Transformation of Kubernetes Optimization

Nexla has Defied Startup Conventional Wisdom with Slow and Steady Growth

WorkJam Named to Time’s List of the TIME100 Most Influential Companies

The Information: Hands Up, This Is a (Virtual) Robbery!