Closed

Suggest the best way of clustering of news articles for aggregator

The project is based on Laravel/ PHP.

Here is the case, I get news from several news sources every minute. Basically they are wordpress post, as the script we are using for news aggregator is based on Wordpress Plugin.

Now, we are fetching those post to Laravel site via one of those Wordpress to Laravel([login to view URL]).

So far, we can using TextRank([login to view URL]), we can do following for any posts:

Find sentences,

Remove stopwords,

Create integer values by find and count the matching words,

Change the integer values by the related words' integer values,

Normalize values to create scores,

Order by scores

To be more precise, we can get bag of words from any wordpress Post.

Now, I am gonna need a complete algorithm and guide, preferably on PHP(if there is any library) that will be able to cluster/ group lists of articles into a same Coverage table. Coverage can have any data(as whatever you say to make algorithm good), what I think is we need coverage ID field, and a field that accepts array of post ID that is similar to each other and has same Coverage ID.

We also have a table called newsTag, that has following field: postId, most important topic mentioned. You can ignore the topic mentioned because, it depends on only the topic that is category, so if we cluster based on topic mentioned from newsTag, we will be limiting clustering ability because in some post there are no topic mentioned.

Provide me complete algorithm, based on it, ask me any questions if you need to and send me a PDF file of algorithm and possible an examples.

Skills: Algorithm, Artificial Intelligence, Machine Learning (ML)

See more: the best way to do marketing for rpl qua, the best way to find a web developer, the best way to get a website developer, the best way to make a decision for programmer, is website the best way to create publicity, the best way to find out who manufactures for a company already recognised, what is the best way to make money selling graphics, what is the best way to sell a domain name, what the best way to design a logo, which is the best way to earn money from home, write an article about the best way of learning and getting information, how can i promote my music in the best way in lebanon, india is the best way outsourcing project on power point, the best way to drive traffic to your website, the best way to find a cheap graphic designer, the best way to find a software developer, best way to learn italian in the car, best way to send photos over the internet, best way to see europe for the first time, best way to learn spanish in the car

About the Employer:
( 8 reviews ) Houston, United States

Project ID: #20076244

2 freelancers are bidding on average $250 for this job

WinterGreenTech

Hi wintergreen develops more projects in A I for last 9 years...so we have deep knowledge in research concept... as per requirement: 1. it's like nlp...remove stop words and apply pos tag. 2. Extract the features lik More

$388 USD in 15 days
(3 Reviews)
3.6
bob12workaholic

I can help you with this project. Writing algorithm in PHP can be painful, workaround would be I can give you a flask based REST API (python) and I can open the endpoints for you to POST coverage_id or <array_of_covera More

$111 USD in 2 days
(0 Reviews)
0.0