Diffbot is a developer of machine learning and computer vision algorithms and public APIs for extracting data from web pages / web scraping.