In AnnoWiki I implemented a function (accessed by the name LikePages) but instead of comparing the names of the pages (i.e. a NameSpace neighbor), I look for pages that link to the same ones that the current one does (i.e. ontologic cousins). The ones that link to the most like this one are likely to be most like the current.
Perhaps both of these benefit the "new reader" of the wiki during a discovery period. I can't think of many reasons established readers and authors would use it though.
Currently, this function is done by comparing forward links from pages, and declaring like pages to be the ones with the highest number of common links (PagesLikeThis).
The second variation: a list of pages that have the current one as a high like page(PagesThisIsLike?).
The third variation: the list of pages with the highest number of common back links(PagesMentionedWithThisOne?).
Now, I don't have much data to use...I don't know how well this works in practice. Has anybody else tried this? Is this worth pursuing? --DaveParker
If you need some raw link data, you can extract it from UseMod wikis such as Meatball in HTML form . Perhaps simpler to parse is the plain text format available from OddMuse wikis such as EmacsWiki . -- AlexSchroeder
Prelim results with Meatball link data:
To get these resuls, I had to remove InterMapRejections, as it links to so may things.
You may want to elide CategoryHomePage from the discussion, although I always suspected that Chris was Bayle's SockPuppet. The bastard(s). ;) -- SunirShah
I can put this online some of this weekend (Feb 21/22) is there is interest. --DaveParker
Interested. :) I'm curious whether "pages like this" or the inverse "pages this is like" is the more useful metric. -- anon.
Yes, you could also try using back links instead of forward links (or vice versa). -- SunirShah
No, I meant something different from Sunir. I can't easily parse your "not unlike" proposal; it sounds like it reduces to your original proposal. Here's what we've got so far:
#1 is your original; #2 is mine (and is identical with #1 if your relation is symmetric, though it doesn't sound like it is); #3 is Sunir's. "not unlike" reduces to either #1 or #2, depending on how you read it. -- anon.
The output of "pages this is like" would be something like:
For this page, the PagesThisIsLike? results: