The Extract Texts From PDF action within the Workflow Media Manager Actions app by Integration Glue allows you to extract text content from a PDF file and store it in a HubSpot property. This action is useful for automating the retrieval of textual data from documents, enabling efficient data management and integration with your CRM processes.
The action automatically processes the specified PDF, extracts its text, and saves the output to a HubSpot property of your choice. This ensures the extracted data is accessible for reporting, analysis, or further automation workflows.
This action is particularly useful when:
- You need to extract and store key information from PDF documents uploaded to HubSpot records.
- You want to automate text processing tasks such as parsing invoices, contracts, or other text-heavy PDFs.
- You are looking to streamline document-based workflows by eliminating manual data extraction.
- URL of the first PDF file refers to the link (URL) to the PDF file from which you want to extract text. The PDF file must be hosted and accessible via a URL, which you can provide here. The file could be hosted in HubSpot, on a server, or any online location where the PDF resides.
- Property to save the PDF in specifies the HubSpot property where you want to store the extracted text. You’ll need to choose a custom or default property on the HubSpot record (e.g., Contact, Company, Deal) where the text from the PDF will be saved. This property will store the raw text extracted from the PDF, allowing you to view, analyze, or use it in further automation within HubSpot.
Here's a step-by-step guide to setting up the Extract Texts From PDF action:
Step 1: Add the Action to Your Workflow
- Go to your HubSpot workflow.
- Choose the Extract Texts From PDF action from the available actions list under the Workflow Media Manager Actions app.
Step 2: Configure the URL of the First PDF File
-
URL of the first PDF file:
- In this field, enter the direct URL to the PDF file from which you want to extract text.
- If the PDF is hosted on HubSpot, use the URL of the file stored within your HubSpot account or any external URL where the PDF is accessible.
Example:
https://yourcompany.com/path/to/yourfile.pdf
Step 3: Configure the Property to Save the PDF Text
- Property to save the PDF in:
- Select the HubSpot property where the extracted text will be stored. This could be a custom text property that you have set up in HubSpot or an existing default property.
- If you're unsure, create a custom property (e.g., "Extracted Text") where the extracted text from the PDF will be saved.
Step 4: Save the Workflow and Test
- Once the action is configured, save your workflow.
- Run tests to ensure the PDF text is extracted correctly and saved into the selected HubSpot property.
Step 5: Monitor the Results
- After running the workflow, check the relevant HubSpot record to confirm that the text from the PDF is stored in the selected property.
- Ensure that the extracted content appears as expected.