PDF Rendering
Any PDF file uploaded to e-PlanREVIEW (EPR) for plan review undergoes several processing steps behind the scenes so that the file, its data, and the images created from the file are available for use by review staff. This is true for any non-attachment file, regardless of whether it was uploaded directly to EPR, received through goPost™️, or sent from any other integrated partner system.
This process takes the multipage PDF document and splits it into individual pages, fixes each page by flattening them down, and then renders multiple images for each page.
Optical Character Recognition (OCR) technology then determines a sheet number for each page before all of this information is sent to the database microservice and then returned to the EPR database for storage.
Process Name | Description | Primary CLI program | Optional CLI program(s) |
---|---|---|---|
PDF Split | Splits multipage PDF into individual pages. | pdftk | qpdf |
PDF Fix | Flattens PDF page. | gs | |
PDF Render | Renders multiple images based on the PDF page: high-quality review page image (Render Review Image), thumbnail image (Render Thumbnail), and OCR capture area image (Render OCR Image). | gs | |
OCR Capture | Captures page’s sheet number. | Amazon Rekognition |
PDF Printing
Process Name | Primary CLI Program | Optional CLI Program(s) |
---|---|---|
Pdf Merge MS | pdftk | |
Printer | Hummus-Recipe (npm) |
3. HANDLING SHEET NUMBERS - PIPELINE FOR EXTRACTING SHEETNUMBER VALUES FROM PAGE