Support for indexing and searching eBooks?

This forum is to discuss general things about X1.

Moderator: Mods

Support for indexing and searching eBooks?

Postby csalsa » Mon Jan 02, 2012 6:19 pm

Hi

I have a technical library of books. Sooner or later I will stop buying paper versions and get electronic books or digital books. However, one of the things I want is to be able to search across eBooks.

X1 does not support .azw/.tpz (Kindle 'Amazon'), .dnl (DNL Reader), .epub (IDPF/EPUB), .lit (Microsoft Reader), .prc/.mobi (Mobipocket), opf (Open eBook), .BBeB (Broadband eBook 'Sony'), .pdb (eReader 'Barnnes & Noble') etc.

But does support .pdf

Is this because some of the formats are propriety? IFilters are not available for text extraction for these formats? Sellent viewers are not available for viewing? DRM prevents text extraction from the eBook for indexing?

With the growth of eBooks, I would think this is an important topic 'support for eBooks' for X1. Support for .epub, .mobi and .azw would be great.

Any thoughts? Any comments?
csalsa
X1 Super User
X1 Super User
 
Posts: 180
Joined: Wed Jun 03, 2009 5:49 pm

Re: Support for indexing and searching eBooks?

Postby Kenward » Tue Jan 03, 2012 3:46 am

This is a question that I have also pondered.

At the moment, I deliberately avoid getting material in formats that X1 cannot handle. But that is going to become harder.

I already have subscriptions to journals (Science, for example, and some IEEE publications) that I can also get as Zinio, zno, files. (Effectively a protected PDF file.) I have never been a fan of that format, partly because of the accessibility problem.

They all seem to demand their own viewers. If X1 can connect to installed viewers and iFilters, fine. I will install the software. But I do not want to have to individually index and search every format.

I sometimes print to PDF for odd formats, but this is not really a viable long-term option.
MK
X1 Search 8.6.1 - Build 6003fa (64-bit)
Windows 10 Pro 64-bit | Windows 10 Home 32-bit
No, I have nothing to do with X1, just a user since 2004.
Kenward
X1 Guru
X1 Guru
 
Posts: 4149
Joined: Tue Apr 20, 2004 2:35 am
Location: UK

Re: Support for indexing and searching eBooks?

Postby WebDev » Tue Feb 25, 2014 11:05 am

Couldn't agree more about X1 supporting reading multiple e-reader formats. I'm currently wrestling that issue over in the Amazon Kindle support forum.

Please take note X1. When I recently found that your 8.1.1 product dramatically trounced Copernic Desktop Professional in every way, I dropped them like a hot potato after many years of use. It was actually pretty easy with their 4.x product. Tons of bugs, user interface oversimplified from previous versions, much slower to index, and totally sketchy and inconsistent in recursively indexing deep folders.

That said, I hope you won't rest on your laurels and will keep adding new formats that you can search. E-book formats are one glaring omission. If it is a DRM issue, then if I were you, I'd have a major initiative to negotiate some kind of ifilter solution with them that avoids DRM violation. Just my humble opinion, but I'm certain that I'm representative of a large percentage of your users. Knowledge workers want quick access to ALL of the information on their hard drives, no matter the format.

http://www.amazon.com/forum/kindle/ref= ... &cdUnsub=1
WebDev
 
Posts: 1
Joined: Tue Feb 25, 2014 10:53 am

Re: Support for indexing and searching eBooks?

Postby visionweaver » Sun Apr 06, 2014 12:16 pm

I, too, would like to see indexing of ebooks added. I use X1 for indexing academic articles and books, and more and more material is now available as e-books. (For now, I convert them to pdf's to index them, but it would be nice to skip that step!)
visionweaver
 
Posts: 1
Joined: Sun Apr 06, 2014 12:14 pm

Re: Support for indexing and searching eBooks?

Postby golfingcat » Sat Oct 18, 2014 4:52 am

Hi,discovering your app after copernic, non drm epub search should be a must for your great software
golfingcat
 
Posts: 2
Joined: Sat Oct 18, 2014 4:49 am

Re: Support for indexing and searching eBooks?

Postby Kenward » Sat Oct 18, 2014 7:45 am

epub is so obvious that you have to wonder why it hasn't happened.

Is it a proprietary format. Is Oracle, owner of the technology that most of the computing world now uses to view files, so far behind the curve that it hasn't come up with viewers for this format?
MK
X1 Search 8.6.1 - Build 6003fa (64-bit)
Windows 10 Pro 64-bit | Windows 10 Home 32-bit
No, I have nothing to do with X1, just a user since 2004.
Kenward
X1 Guru
X1 Guru
 
Posts: 4149
Joined: Tue Apr 20, 2004 2:35 am
Location: UK

Re: Support for indexing and searching eBooks?

Postby Utomo » Wed Aug 10, 2016 3:30 am

+1 for ebooks support

Ata least support most common format:
.epub
.mobi
.azw
Utomo
 
Posts: 12
Joined: Wed Jul 20, 2016 9:28 pm

Re: Support for indexing and searching eBooks?

Postby tm19 » Mon Jan 30, 2017 3:55 am

Ditto the OP's request for non-DRM ePub indexing.

Ditto WebDev's comments about Copernic and the temptation to jump back now that it supports ePub files.
tm19
 
Posts: 5
Joined: Mon Oct 12, 2015 4:43 pm

Re: Support for indexing and searching eBooks?

Postby wd210 » Mon Jun 01, 2020 11:10 am

This thread is 8+ years old, and there's still no search for .epub files? Is it that difficult to add this capability? The format has only gotten more widespread since the original post.
wd210
 
Posts: 4
Joined: Tue Aug 09, 2011 6:31 am

Re: Support for indexing and searching eBooks?

Postby tm19 » Tue Jun 02, 2020 7:17 am

It's not an elegant solution but I've manually added EPUB and MOBI to the Global Whitelist Settings.

For EPUB: I can find the item by filename and open it via the Calibre eBook viewer. X1 will also index the contents but only return the embedded HTML file. From here I can either view it in the preview window, open it up with my default viewer, or track down the parent EPUB and go from there.

For MOBI: X1 does not seem to index the contents, and so it will only find by filename. However, I generally convert MOBI to EPUB using Calibre. This speeds up opening in Calibre and makes the contents searchable in X1.

Of course, all of this applies only to non-DRM files.

I still don't know why X1 can't add this feature. By the way, I have stuck with X1 for one simple reason: although Copernic Desktop Search has built-in support for ePub files, they insist on using the terrible, terrible annual subscription model. I have similar hangups with Adobe, Office 365, and commercial antivirus programs. Yeah, I know, I got issues.
tm19
 
Posts: 5
Joined: Mon Oct 12, 2015 4:43 pm


Return to General Discussion

Who is online

Users browsing this forum: No registered users and 65 guests

cron