COMING SOON: Spring 2025 Release
By Garry Klooesterman | 2025 Apr 04
4 min
Tags
java
Summary: PDFs are popular for many reasons, including being an easy way to share information. However, they pose certain challenges of their own, especially when dealing with large PDFs with many pages. For example, tasks such as manually extracting select pages or merging multiple PDFs into one file are inefficient. To handle these, and many other tasks, a PDF SDK such as the Apryse SDK can help streamline and automate the process.
Using PDFs for storing and sharing information digitally is a pretty common practice. Among other things, PDFs make sharing information easy and they display that information reliably regardless of the hardware or software used to open them.
As great as PDFs are, they come with certain challenges such as when dealing with large PDFs containing hundreds of pages when you really only need a small section or select pages, or maybe you need to merge multiple PDFs into one file. To handle these challenges, you could use a PDF editor, such as xodo.com, Xodo PDF Studio, or an app based one like Apryse WebViewer. However, a portion of the process would still be manual.
For a more automated approach, we can use the Apryse SDK to manipulate PDFs in a few ways such as splitting a PDF into multiple pages and merging multiple PDFs into one. We’ll use Java for these examples, but the SDK is also available for other languages and frameworks including C++, .NET, Python and Node.js.
Getting started is fairly easy.
Now that we’ve got everything set up, we'll look at some examples of what we can do. In this blog, we’ll look at how to:
There may be times when you need to split a PDF into multiple pages. For example, you’re splitting out a report of your employees to create a separate file for each employee.
We’ll use the following code to do just that.
// Sample 1 - Split a PDF document into multiple pages
System.out.println("_______________________________________________");
System.out.println("Sample 1 - Split a PDF document into multiple pages...");
System.out.println("Opening the input pdf...");
try (PDFDoc in_doc = new PDFDoc(input_path + "newsletter.pdf")) {
in_doc.initSecurityHandler();
int page_num = in_doc.getPageCount();
for (int i = 1; i <= page_num; ++i) {
try (PDFDoc new_doc = new PDFDoc()) {
new_doc.insertPages(0, in_doc, i, i, PDFDoc.InsertBookmarkMode.NONE, null);
String fname = "newsletter_split_page_" + i + ".pdf";
new_doc.save(output_path + fname, SDFDoc.SaveMode.REMOVE_UNUSED, null);
// output PDF new_doc
System.out.println("Done. Result saved in newsletter_split_page_" + i + ".pdf");
}
}
} catch (Exception e2) {
System.out.println(e2);
}
Now let’s look at the previous example in reverse. You have a separate PDF for each of your employees and need to merge them into one report for your manager.
Let’s use the following code to merge those PDFs.
// Sample 2 - Merge several PDF documents into one
System.out.println("_______________________________________________");
System.out.println("Sample 2 - Merge several PDF documents into one...");
try (PDFDoc new_doc = new PDFDoc()) {
new_doc.initSecurityHandler();
int page_num = 15;
for (int i = 1; i <= page_num; ++i) {
System.out.println("Opening newsletter_split_page_" + i + ".pdf");
String fname = "newsletter_split_page_" + i + ".pdf";
try (PDFDoc in_doc = new PDFDoc(output_path + fname)) {
new_doc.insertPages(i, in_doc, 1, in_doc.getPageCount(), PDFDoc.InsertBookmarkMode.NONE, null);
}
}
new_doc.save(output_path + "newsletter_merge_pages.pdf", SDFDoc.SaveMode.REMOVE_UNUSED, null);
System.out.println("Done. Result saved in newsletter_merge_pages.pdf");
} catch (Exception e2) {
System.out.println(e2);
}
Why would you ever want to do this, you ask? Well, let’s look at the reports example again. Your merged report is made up of separate sections for each employee. Each section has two pages; one for their stats and one for their goals and progress. Since you only need the pages with each of their stats, you could use the following code to delete every second page, leaving just the pages you need for reporting.
// Sample 3 - Delete every second page
System.out.println("_______________________________________________");
System.out.println("Sample 3 - Delete every second page...");
System.out.println("Opening the input pdf...");
try (PDFDoc in_doc = new PDFDoc(input_path + "newsletter.pdf")) {
in_doc.initSecurityHandler();
int page_num = in_doc.getPageCount();
while (page_num >= 1) {
PageIterator itr = in_doc.getPageIterator(page_num);
in_doc.pageRemove(itr);
page_num -= 2;
}
in_doc.save(output_path + "newsletter_page_remove.pdf", SDFDoc.SaveMode.NO_FLAGS, null);
System.out.println("Done. Result saved in newsletter_page_remove.pdf...");
} catch (Exception e2) {
System.out.println(e2);
}
See our documentation for more examples of what you can do such as removing a PDF page and cropping a page.
We’ve just covered a few of the many ways to manipulate PDFs using the Apryse PDF SDK. The code in these examples can be expanded to create more elaborate solutions to help you to manipulate PDFs, whether that is to split, add, remove, crop or reorder pages, or to rotate single or multiple pages in your PDF.
If you’re curious to see it in action, why not try it out for yourself.
Get started now or contact our sales team for any questions. You can also check out our Discord community for support and discussions.
Tags
java
Garry Klooesterman
Senior Technical Content Creator
Share this post
PRODUCTS
Platform Integrations
End User Applications
Popular Content