This assignment is part of a series of experimental programming projects.
Below is the goal of this assignment. We are asking you to achieve this goal in the best, most efficient way you can. Five programmers will be selected to complete this task.
In your bid, please include the following:
- The programming language you intend to use
- The general method you propose
GOAL:
We have a system of tagging all information by “Subject/Section” and ICON1, ICON2, ICON3, ICON4. For example: Health = section. Medicine = ICON1, Cancer = ICON2, etc. We need a system that can quickly read a text string, and then automatically discover what the likely categories it belongs to. For example, a text string might have words like “Today the basketball star from the Celtics scored big”. Section = sports, ICON1 = basketball, ICON2 = Celtics, etc.
Part of the process of doing this is to first find “any” categories, before assigning Section and ICONs. We will supply a list of “categories” from Wikipedia. The code should start with the longest string and end with the shortest string so that “gastronomy” is found before “astronomy” with a few checks to insure spacing, punctuation, etc. but also joins like “collegefootball” versus “college football” where in the former case this string might be in a URL, so it should catch it.
Once categories are established, a second routine will assign sections, etc. These will come from separate tables prepared and provided by us. This will get better and better as we see what wholes there are in the system.
Dear Sir,
I have already made solution of this type. I got tables named twitter_message(msg_id, msg_string,msg_value), which contains URL in msg_string column. I have to calculate the string weight by comparing each word with a dictionary_tab(dic_word,word_weight) table.
I made a database function(Oracle 11g R2) out the numeric value of any URL passed as string parameter. I believe I can make a solution for you.
- The programming language you intend to use ->>>> (Oracle 11g R2)
- The general method you propose(PL/Sql function, that can be called in any oracle environment)
Regards
Rana