Indicators on firm level innovation activities from web scraped data

Sajad Ashouri, Arho Suominen, Arash Hajikhani, Lukas Pukelis, Torben Schubert, Serdar Türkeli, Cees Van Beers, Scott Cunningham

Research output: Contribution to journalArticleAcademicpeer-review


This article presents data on companies' innovative behavior measured at the firm-level based on web scraped firm-level data derived from medium-high and high-technology companies in the European Union and the United Kingdom. The data are retrieved from individual company websites and contains in total data on 96,921 companies. The data provide information on various aspects of innovation, most significantly the research and development orientation of the company at the company and product level, the company's collaborative activities, company's products, and use of standards. In addition to the web scraped data, the dataset aggregates a variety firm-level indicators including patenting activities. In total, the dataset includes 21 variables with unique identifiers which enables connecting to other databases such as financial data.

Original languageEnglish
Article number108246
Pages (from-to)108246
Number of pages14
JournalData in brief
Publication statusPublished - Jun 2022


  • Big data
  • Web scraped data
  • Text data
  • Innovation
  • Firm-level data

Cite this