Apache PDFBox is an open source pure-Java library that can be used to create, render, print, split, merge, alter, verify and extract text and meta-data of PDF files.Open Hub reports over 4,000 commits (since the start as an Apache project) by 17 contributors representing more than 120,000 lines of code. PDFBox has a well established, mature codebase maintained by an average size development team with increasing Y-O-Y commits.