Edgar is classifying the reporting companies by SIC (Standard Industrial Classification) Code. We can use this information to calculate the total sales per sector and then calculate the % share of the individual company.
This is helping us to identify the companies with a big market share.
The result (which uses Spark) can be found in the following Gist. Alternatively here is a version which purely relies on Scala and Smart EDGAR