I tend to be in the camp of "scan everything with an electron microscope and simulate lasers", but honestly there is a point where a decision has to be made.

Now, it's important to be _sure_ there isn't any useful data, even just silly intentional data, in that space… but from what I've learnt reading around, it would seem that its genuinely "useless" and unused. As long as the beginning and end of these sectors are well known are documented, I see no harm with the current method which ignores those bits.

2

(7 replies, posted in Guests & account requests)

Well, keep in mind one thing… The Redump ID is merely a database convenience. The XML files you are looking at (The DAT files) are NOT complete data dumps. most of the data is in fact contained on the site's DB, whereas the .dat file is a special-purpose format for file management software. To have a complete picture of the data, you need at least both sets.

moreover, keep in mind Redump is a media preservation project. Each single ID refers to a single piece of physical media the group has documented (i.e. dumped, verified, and collected metadata for). This means that an ID won't be uniquely associated to a video game, but to a specific disc of a specific edition of said game.

It would be great to meet in chat over on discord as creating a broader KG of videogames is something I think is very intersting, but perhaps needs a little more pre-work than simply starting from the Redump IDs.

3

(7 replies, posted in Guests & account requests)

Wikidata is essentially a semantic web project, it's a site "similar" in idea to wikipedia (human collaborative curation) but aiming to build a structured knowledge graph.

I don't know what OP is looking to do, but a quick look at the page it seems like he would like to replicate Redump DB's info (some of it, at least) on the KG.


What level of data are you planning to place into the graph?

OS tagging can become quite a nightmare though. is it enough to say "DOS", "Win16", "win32" etc and make assumptions about possible compatibility? or do we copy what the disc says "Win95-98" etc?
It's a dimention that would be great to capture, of course, but if required upfront mike make dumping just that much more cumbersome.

Hi,

I've noticed recently after adding http://redump.org/disc/41446/ that the title was changed, capitalizing nouns.
This is customary in english, however in Italian it is normal to only capitalize the first word, and proper nouns (i.e. names etc)
I usually try to follow the original capitalization on the packaging when possible, but as a rule of thumb IMHO this capitalization shouldn't be artificially injected.

actually, I have a similar doubt. a while back I bought some discount game editions, one of which was a replacement of my dead RollerCoaster Tycoon 2 game. this particular edition contains RCT2, and both its expansions, as a single install DVD. I'm not really sure what to call it, as no unique title other than "three games" is given, on top of of course the list of the three originals contained.