We’d like to remind Forumites to please avoid political debate on the Forum.
This is to keep it a safe and useful space for MoneySaving discussions. Threads that are – or become – political in nature may be removed in line with the Forum’s rules. Thank you for your understanding.
📨 Have you signed up to the Forum's new Email Digest yet? Get a selection of trending threads sent straight to your inbox daily, weekly or monthly!
The Forum now has a brand new text editor, adding a bunch of handy features to use when creating posts. Read more in our how-to guide
Want to become a Forum Ambassador? Visit the Community Noticeboard for details on how to apply
Trying to understand whats going on here?
rowan222
Posts: 56 Forumite
This concerns a private members society here in the UK. To avoid any copy-rite issues or any forum rules regarding posting content I'm anonymising some of the details. I'm just trying to understand why/how this is happening on a technical level.
This refers to a private members society concerned with a niche area of history. An annual membership is charged and a quarterly journal is published in hard copy and posted to members. A digital archive of all past journal up to present is available to purchase by members only. No copies of the journal are available on their website. I am not a member although interested in the subject area and have considered joining in the past.
I'll refer to the society as "Society" and the journal name as "Journal"
So I was googling around to see if there were any "Journal" copies available online to have a browse. I found none. Continuing with more advanced search techniques I then found a couple of old "Journal" copies as a PDF that I could download.
The web address seemed unusual and was in the format of:
(again anonymised and journal(s) is the exact name of said publication)
I started playing around with the web address to see were more. Just entering:
resulted in a "403 Forbidden"
However I quickly found that simply changing the last part of the web address to another number e.g.: /journal165.pdf
resulted in revealing another downloadable PDF. In fact I can enter any journal number and get a PDF that can be downloaded. In fact any journal published in their 50 year history is available as a PDF download.I'm familiar with a couple of online archive sites that host a lot of articles some of which I'm sure are unauthorised but this seems very different. It almost seems like someones made a big mistake here. I'm positive "The Society" has not authorised this.
I'd just like to understand what exactly is going on. Any thoughts?
-1
Comments
-
Someone has tried to set up Dynamic DNS for a site and got it wrong. Likely that the archive is stored on a local server rather than an IP's site.
Instead of stealing their stuff why not just contact The Society and let them know?This refers to a private members society concerned with a niche area of history.Is it the Nazis?
1 -
flaneurs_lobster said:
Instead of stealing their stuff why not just contact The Society and let them know?This refers to a private members society concerned with a niche area of history.Is it the Nazis?
I'm not "stealing their stuff"! I just wanted to find a couple of sample copies online and assess if it was something worth subscribing to. I was not looking for or expecting to find the complete archive. That was a bit of a shock. I'll be informing them as its obviouslyr a security breach.No nothing to do with the Nazi's, much earlier than that!
0 -
. I was not looking for or expecting to find the complete archive. That was a bit of a shock. I'll be informing them as its obviouslyr a security breach.Its not necessarily a "security breach" if you found it in a Google search..I've found all kinds of old school or society newsletters (typically where Fred Bloggs has been mentioned because he was student of the year or scored the most goals in a football match or whatever) and something in those newsletters happened to contain whatever I'd been searching for. When you go back to the home page none of those are linked because they all published like 5/10 years ago. What it means is it was an active link at some point and Google's found them, but the institute in question never removed the files.If these journals have been deliberately arranged in a way that is accessible on the website (not necessarily linked from anywhere) then that doesn't make it a security breach either.0
-
The likely chain is simple. Someone exposed the PDFs on a publicly reachable HTTPS service on port 2443. At least one direct link leaked somewhere, possibly years ago. Googlebot fetched it, saw a valid PDF, and indexed it. The lack of access control allowed further files to be fetched once their names were known/guessed.
How to prevent this depends on the web-server hosting the, ahem, 'journals'.0 -
Vitor said:The likely chain is simple. Someone exposed the PDFs on a publicly reachable HTTPS service on port 2443. At least one direct link leaked somewhere, possibly years ago. Googlebot fetched it, saw a valid PDF, and indexed it. The lack of access control allowed further files to be fetched once their names were known/guessed.
How to prevent this depends on the web-server hosting the, ahem, 'journals'.That sort of makes sense now. It would seem that the server hosting these did not intend them to public for two reasons1. Theres no direct link to find them, i.e. it's not easy unless you get curious like I did.2. The server in the format https://xxxx-yyyyyyy.ddns.net: is not the Societies website even remotely.When I did the google search for "The Society" and "Journal I got hits in the form of:"No IP" followed by references to a particular journal number. When I clicked on that it loaded that journal as a PDF that could be saved. But only that particular copy.
0 -
Deep linking into a website, in the way you have done it, can be a crime under the Computer Misuse Act. That's especially in the way you have done it, by guessing links that are not published by the website owners.
Unless you are 100% sure the owners of the website will welcome your telling them there's a security flaw, rather than reporting you to the police, it might be prudent simply not to tell them.No reliance should be placed on the above! Absolutely none, do you hear?1 -
Just a quick google:
'If a website uses predictable, human-readable URL structures (e.g.,
example.com/report-Q1,example.com/report-Q2), guessing the next in the sequence is often considered reasonable navigation'0 -
Isn't that how the Budget got leaked early this year?
1 -
Indeed. I’m not saying that the law isn’t daft. :)
No reliance should be placed on the above! Absolutely none, do you hear?0
Confirm your email address to Create Threads and Reply
Categories
- All Categories
- 353.9K Banking & Borrowing
- 254.3K Reduce Debt & Boost Income
- 455.2K Spending & Discounts
- 246.9K Work, Benefits & Business
- 603.5K Mortgages, Homes & Bills
- 178.3K Life & Family
- 261K Travel & Transport
- 1.5M Hobbies & Leisure
- 16.1K Discuss & Feedback
- 37.7K Read-Only Boards

