Relating content automatically in Plone

Comments

[this is good]
Great idea, as I would expect from you :) What version of plone would you like to target? I have interest in this for both 2.5 and 3, so I'd be game for either. I do not think the automatically generated keywords should go into the regular keywords list -- I'd prefer they get their own field. Additionally, I'd like to have the ability to run some functions on them before they're stored -- say, to compare them to my controlled vocabulary and have only the terms that match it get added. Do you think that adds too much overhead though? Could the zope component architecture help us here? Also, on the match side of things, would you just use a smartfolder, or would you use something more sophisticated?

I think at least we'll need a workflow script that can be attached to any transition event (will work in 2.5 via jensens's DCWorkflow patches in AGX) and a portlet for showing matches. We'll also need to make sure our field is indexed properly, and maybe make a script to run all existing content in small batches per day. I look forward to working with you on this.
Matt, I think we better focus on two things: the retrieval of the keywords and the algorithm to create the relations among content. If we can solve that, then we can have also different implementations for different Plone versions or needs.

For example, I imagine using this in a special field named after some NITF element with an adapter, that's probably not your idea. You want to filter results against a controlled vocabulary, but that's not my case.

I think different taxonomies can coexist on a site. I mean, you can have social bookmarking, controlled vocabularies and term extraction over the same content and use it for different purposes.

Let's talk about this on IRC.
OK so what you're saying basically is that there IS no solution in Plone. That's pretty pathetic for what is supposedly the biggest and best CMS. No algorithms to connect content.

For all the hype of the plone community about their product, the examples in plone.net are not impressive at all. The whole top down approach where an admin gives rights to users and groups instead of it coming out of the community... still no social networks, is shocking.

But not even having good algorithsm to connect CONTENT: the core of Plone is pretty sad.

Very disappointed in all the hype and no delivery. Internet in 2009/2010 requires a hell of a lot more than a portal admin and a bunch of documents thrown on the web.
Erik,

"Internet in 2009/2010 requires...."

You might want to check the date of this article ;) you are 18 months behind...

Besides, what the author is saying is that there *IS* a solution, and all the required parts do do this are built in to Plone, apart from using an external web service to do the actual summarising. Which is you ask me, is very web 2.0 ;)

In the 18 months since this article was written, there is now another python text summariser which doesn't require Yahoo's web service if you wanted to use that:

http://pypi.python.org/pypi/topia.termextract

-Matt
Erik,

Please stop trolling. You clearly have not done very much research about Plone or its capabilities.

(And for the record, there exist several ways to relate content, both based on keywords and based on explicit relationships, in stock Plone. This article is about automatic text summaries based on linguistic analysis. That's not something most CMS' do, or at least do well).

Martin
it seems that Yahoo's Term Extraction service doesn't support Russian =(

Post a comment

Already a Vox member? Sign in