I've been trying to collate 'The thoughts of captain paranoia' again, prompted by this recent DofE malarkey.
What I wanted to do was find a tool to examine the saved 'print threads', and automatically find keywords, and produce lists of threads that contain these words. But, despite the fact that we have played with information mining things like this a bit (not personally), I couldn't find a freeware tool to do this.
So I've been playing with the Unix 'productivity tools', such as tr, grep, awk & sort, and have now analysed all my contributions to the threads I've posted on. This has produced a word frequency table, which I find quite interesting.
As is to be expected, the most frequent are to be found in the usual list of most common English words, but it's not long before interest-specific words start to appear. e.g. the first such is at #33, being 'water'. Then more appear:
<not consecutive>
jacket
fabric
layer
fleece
shell
waterproof
fuel
base
windproof
top
bag
gps
montane
air
lightweight
meths
pertex
<then little clusters start to appear>
gear
small
cheap
wear
warm
pan
walking
right
product
idea
pound
insulation
gas
paramo
ml
experience
fabrics
clothing
map
help
climbing
I find it interesting that this actually says quite a lot about my postings, and the things I post about...
And yes, I know that I need to get out more...