Linux - SecurityThis forum is for all security related questions.
Questions, tips, system compromises, firewalls, etc. are all included here.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
so, this isnt exactly a Linux specific Q, but i am looking for some info.
anyone know if its possible to search the cache of the bigger engines like Gool, Bingg, Yahooo. i can see how the access to the cache can be sold as a service to say the Feds, but can the public get access?
i am working an issue where some cached pages might have data which would be a security issue for my customer.
I'm not sure what you think is in, eg, Google's cache, but it may not work in the way that you think that it does.
In any case, to the extent that Google caches things that you are interested in, Google has access to that information. Now if the question could be 'Can some outsider break in to Google and get access to stuff that Google didn't intend them to?' then you'd have to say that while Google would tell you about all of their measures to make this impossible, if you found this a very serious outcome, you'd have to say that there can be no guarantee that it can never happen.
For most people there are bigger risks than this, but, if you were very sensitive about this particular issue, then you have a problem.
The one case that I can think of off hand where this kind of thing happened, it wasn't a search engine.
ok, i know how gool cache works. i can query the cache for a specific page to see what that cached paged looks like, and this is open to the public. i want to search the public cache (query it), etc. its easier to query then it is for me to build a list of URL's and then pull thise in via php and then serach using regex, etc.
customer may have leaked some data, of which they changed their html, but engine cache's may still have a copy of pages that contain this data, etc.
so now you know what the cache is. i am looking for a way to use the engine operators to find specific data that is in cached pages. we the public have access to that latest cached page, can you imagine how many copies gool has, do you see how this may be useful to say the feds or local law enforcement, you change your public Facebook stuff thinking its gone, yet gool has every change you made, etc etc. i just need to query for specific data pattern in the cache that is available to the public, etc. i am thinking i need to build a uri list, use PHP to pull those from gool cache, and then grep the page content for my pattern, etc.
Last edited by Linux_Kidd; 03-22-2013 at 03:42 PM.
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.