II want to discuss the automation of PDFs, or known as Portable Document Format. PDFs are an extremely popular file format for sharing information across businesses of all sizes. We encounter PDFs in contracts, reports, legal documents, press releases, and more. PDFs can be static or dynamic, depending on their use. Given their widespread usage, it’s essential to have a focused strategy for testing them. In some domains, such as banking, the number of PDFs to be verified can be extensive, often reaching into the hundreds or more. This is where automation becomes crucial. However, testing PDFs presents a unique challenge: unlike web pages, PDFs don’t have locators, making them difficult to test.
In this session, we will address this problem. We’ll outline a comprehensive testing strategy based on the test pyramid, explore available open-source tools, discuss their advantages and limitations, and go through demo code to understand the implementation. By the end of this session, you will have a complete test automation strategy for PDFs.
Talk Takeaways
I am a Lead Quality Analyst at ThoughtWorks with 18 years of experience in the industry.
Prior to Thoughtworks I have been part of product organisations like Sun Microsystems and Intuit. In this journey I am fortunate to drive quality for some of the applications which are widely used across the globe and enterprise organisation.