LinuxQuestions.org
Help answer threads with 0 replies.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 12-17-2023, 10:35 AM   #1
unstamped
LQ Newbie
 
Registered: Sep 2023
Distribution: ArchLinux
Posts: 13

Rep: Reputation: 0
Good reader for an extremely large pdf file


Hello.

I have a large PDF document that I want to be able to search for keywords. The file in question is about 400MB. It is searchable but navigating through the file is difficult; searches are taking up to 10 seconds, if not more, and simply scrolling takes a few seconds to load the page.
I don't need to open it, per say. A simple grep-like functionality will suffice for me. The file is similar to a dictionary, so i just need it to find the keyword and list the line containing it.

Is there anything that can process this optimally?

Thank you.
 
Old 12-17-2023, 10:40 AM   #2
dugan
LQ Guru
 
Registered: Nov 2003
Location: Canada
Distribution: distro hopper
Posts: 11,275

Rep: Reputation: 5342Reputation: 5342Reputation: 5342Reputation: 5342Reputation: 5342Reputation: 5342Reputation: 5342Reputation: 5342Reputation: 5342Reputation: 5342Reputation: 5342
You can try converting it to text with "pdftotext" first. Here's the Wikipedia article:

https://en.wikipedia.org/wiki/Pdftotext
 
1 members found this post helpful.
Old 12-17-2023, 10:43 AM   #3
michaelk
Moderator
 
Registered: Aug 2002
Posts: 25,837

Rep: Reputation: 5971Reputation: 5971Reputation: 5971Reputation: 5971Reputation: 5971Reputation: 5971Reputation: 5971Reputation: 5971Reputation: 5971Reputation: 5971Reputation: 5971
Try pdfgrep, works similar to grep.
 
2 members found this post helpful.
Old 12-17-2023, 11:39 AM   #4
unstamped
LQ Newbie
 
Registered: Sep 2023
Distribution: ArchLinux
Posts: 13

Original Poster
Rep: Reputation: 0
Thank you so much!
Both work well, but I just love the pdftotext utility. I love the simplicity; pure plain text . And I had it all along, had no idea it is included by default.

Last edited by unstamped; 12-17-2023 at 11:40 AM.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
[SOLVED] Firefox newcomer: can't directly send a PDF file coming throu my internet connection to disk. Instead, first a PDF reader is run. stf92 Slackware 2 08-28-2017 12:28 AM
Slim free PDF Reader as alternative to Adobe Reader cccc Debian 6 10-14-2010 02:51 PM
Editing extremely large files, too large for memory? SirTristan Linux - Newbie 2 12-22-2009 03:06 PM
IBM T42 "Extremely, EXTREMELY Slow" alwayslearning Linux - Laptop and Netbook 5 10-11-2009 03:34 AM
LXer: This week at LWN: Large pages, large blocks, and large problems LXer Syndicated Linux News 0 09-27-2007 11:40 AM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 04:43 AM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration