We have data on 13,558 companies that use Apache Spark. The companies using Apache Spark are most often found in United States and in the Computer Software industry. Apache Spark is most often used by companies with 50-200 employees and 1M-10M dollars in revenue. Our data for Apache Spark usage goes back as far as 5 years and 3 months.
We use the best indexing techniques combined with advanced data science to monitor the market share of over 15,000 technology products, including Big Data. By scanning billions of public documents, we are able to collect deep insights on every company, with over 100 data fields per company at an average. In the Big Data category, Apache Spark has a market share of about 3.0%. Other major and competing products in this category include:
Apache Spark is an open source cluster computing framework. Originally developed at the University of California, Berkeley's AMPLab, the Spark codebase was later donated to the Apache Software Foundation that has maintained it since. Spark provides an interface for programming entire clusters with implicit data parallelism and fault-tolerance.
Looking at Apache Spark customers by industry, we find that Computer Software (32%) and Information Technology and Services (14%) are the largest segments.
59% of Apache Spark customers are in United States, 6% are in United Kingdom and 6% are in India.
Of all the customers that are using Apache Spark, 32% are small (<50 employees), 42% are medium-sized and 26% are large (>1000 employees).
Of all the customers that are using Apache Spark, a majority (59%) are small (<$50M), 24% are large (>$1000M) and 9% are medium-sized.