Some thoughts on thesauruses

I’ve long pondered why the records world doesn’t use or understand thesauruses properly. Twenty years ago a library user searching for automobiles would be told – by the software – that the preferred term was cars, and shown the results for a search on cars. What’s more, the same functionality provided guidance through the maze of organizational history: earlier and later names, splits and mergers. A thesaurus is one of a family of terms including ontology, taxonomy and classification. But if it lacks semantic guidance, it’s not a thesaurus, it’s a file plan, or, in specialized form, just a disposal schedule. In records, we don’t even have this semantic functionality in most of our software systems.

Even thesauruses, as defined by the ISO standards, are now inadequate. Most organizations have clusters of terms which demand several equivalent synonyms. It is a distortion to say that one of these must be the preferred term, because it depends, for example, if you are in the legal, technical or public domain. A century ago, faceted classifications using index cards were common in research organizations throughout Europe, and we are only now able once again to provide that flexibility to our users, through tools such as folksonomies.

Tags:

Mass storage and the future of Records Management

Had an interesting discussion on a Listserv (remember them?) about recordkeeping and mass storage. I mentioned how I don’t select different files to be kept for different periods as the law requires (you know, financial stuff 7 years etc), I just image everything using BootitNG, burn it onto DVD’s and keep the lot until the bottom drawer fills up, then I toss the oldest ones. Currently that means I have disk images going back to the late 1990′s, containing some documents I still need to keep even if the law says I could have discarded them, and others I could have tossed years ago but can’t easily separate out from the disk images.

Hey, it works for me.

In contrast, in my work environment, I’m running a full blown records management operation, with policies, procedures, staff, training, budgets, software systems and databases, all designed to tag records according to the multitude of different disposal triggers and dates.  But what if the same principle applies – just buy lots of cheap storage and keep the lot? Let’s assume we solve the problems of changes in technology and software over time (I used to have some WordPerfect 4.1 files on 3.5 inch diskettes, with nary a diskette drive in the house for several years now) what then is the role of a records manager?

There will be a role, but it won’t be centered around disposal, which we will manage with a very broad big bucket approach. Instead of the hundreds of different categories we have today, we’ll maybe have ten, because there are few risks and little or no extra cost in keeping stuff longer.

Quite simply, we have to get away from disposal, and focus instead on helping people index and organise their documents in simple ways so they can be easily retrieved when a Google-type search doesn’t work. For too long the Records profession has let disposal distort everything else we do, and it’s the tail wagging the dog.

Hence my interest in epistemology, simple search systems (I use Copernic on my PC), disc imaging and large lumps of cheap storage.

Tags: , ,

Waxing philosophical

I’ve never formally studied philosophy, but lately it has come up in my work on electronic document management, especially file plans, classification,  taxonomy, ontology – call it what you will. I need simple ways of teaching basic concepts, so I’ve been grappling with questions like How do we know things? How do we differentiate one thing from another? And guess what – it’s called metaphysics, and epistemology.

So I’ve been reading some introductory books from the Parramatta City Library, right near where I work. Some are hard going, some incomprehensible. The Complete Idiot’s Guide to Philosophy is just so badly written! But Stephen Hetherington, Roger Scruton and the wonderfully lucid A. C. Grayling have really got me thinking. Great stuff.

Right now I’m trying to understand postmodernism, post-structuralism and deconstructionism. I know it’s hard to write simply when you are trying to explain complex ideas, but frankly most of what I’ve read so far is hard going, dense, turgid and incomprehensible. It’s a shame because there are some good ideas hidden away in there.  I’m also not sure about starting with useful linguistic tools and extrapolating so far as to question the validity of pretty well everything written or spoken. Its also a pity some mediocre intellects have used all this to wreak havoc in history faculties and secondary education literature syllabuses.

However, as I said, there appears to be some useful stuff in there. I’ll persevere a while yet.

Tags: , , , ,

Windows 7?

Right now, I’m running Windows 7 RC (release candidate) on my desktop, in a virtual PC (I use VirtualBox). Away from my day job I’m the sort-of tech support for several old, el cheapo PC’s for assorted family and friends. I’ve built most of the PC’s by cannibalising even older ones. All run Windows XP, I didn’t want anything to do with Vista. Recently one was replaced and the new machine (wish it were mine) came with Vista, which forced the issue on me. So I set up Win 7, and I’m quite impressed. No crashes so far, and all my hardware worked without needing to hunt down drivers. Under VirtualBox you can’t run the full Aero interface, but the look and feel is still there.

Two of the PC’s I care for aren’t up to running 7, so they’ll stay with XP. Whether the others change depends largely on price, and whether Microsoft brings the family pack to Australia at a reasonable price and with good availability. But there is no doubt I’ll swap most of the PC’s to Windows 7 eventually. I predict what I’ll end up supporting is a few old PC’s running XP until they get upgraded, and the remainder with assorted varieties of 7, and I’ll put Ubuntu on my desktop and run XP under VirtualBox. The latest Ubuntu is looking just as impressive as 7, so unless you are running software which only runs under Windows, it too is worth thinking about.

And if you do trial Ubuntu under VirtualBox, don’t set the video RAM to the 128 mb maximum, the login screen gets garbled. Once you log in all is well, but it’s interesting trying to log in without anything intelligible on screen. Set video RAM to 64 and all is well.

Tags: ,

BootitNG

I’ve just been partitioning a hard drive on one of our home PC’s, using Terabyte Unlimited‘s BootitNG, and it occurred to me that I’ve been using this, and their Image for Windows program, to do image backups and manage drive partitions, since an article by Fred Langa put me onto them in 2002. This stuff is sturdy! I’ve never had a problem, I have a stack of DVD’s going back to the ark, plus the last few month’s worth on assorted external and internal drives. It’s saved my hide a couple of times. It’s pretty geeky to use, but reliable as you’d ever want.

What I’m actually doing is splitting one 500g drive into two,  one for Windows and the other for photos. With all the photos in the C: drive, backups were taking far too long. Now I can back up the high-activity area easily, and do the photos every now and again.

So if you know what you are doing, BootitNG is highly recommended.

Tags:

Same web site, new look and new tools

Well, after many years doing the web site myself with a text editor, I’ve finally succumbed: a whole new look, all set up and edited in WordPress. As a result, the site will be a work in progress for a while, as I explore functionality, learn how to drive it all, and rewrite some seriously out of date content. Hold your breath as we go for a ride on this, it’s fun!

Tags: