Improvement of blog statistics program
To be able to make this project you must have a deep knowledge in PHP, MySql incl. using Match, deep understanding of html and other code on web sites and specifically blogs
Background and problem to solve
We have a huge database table (bd_blog.blog_data) with content from blogs. The table is updated by our web crawler.
We create marketing campaigns and for each campaign we have a list of mandatory keywords.
The blog statistics program search blog_data for keywords in two different ways:
1. All content from blogs are searched for the keywords (by mysql): All found blogs inserts/updates records in buzzador_blog_statistics.
2. Body of content from blogs are searched for keywords in another way: All found blogs updates records in buzzador_blog_statistics and sets field web_status1=OK (otherwise web_status1=Not_ok)
We have made a manual verification and have a list of 100 examples where where web_status1=Not_ok . About 40 of them should have web_status1=ok.
Conclusion:
1. above finds too many blogs matching the campaign
2. Above doesn’t find all relevant blogs
What we need is to find all relevant blogs and skip the not relevant from the beginning.
We shall not change anything in 1. until code in 2. has been improved so all relevant blogs are found.
When that is solved (but not before that!) we can change the logic in 1. Above so not_ok records are never stored in database table.
Input to program is
keywords per campaign from campaigns table: [login to view URL]
a very big table with content from blogs table: bd_blog.blog_data
Output from program is
data in the following tables:
buzzador_blog_statistics
buzzador_blog_statistics_pages
buzzador_blog_statistic_keywords
and an update in campaigns table with date_time for last update
All information in buzzador_blog_statistics, buzzador_blog_statistics_pages and
buzzador_blog_statistic_keywords are inserted and updated by the [login to view URL] program.
Development environment:
You will get a copy of the program.
I will put copies of the relevant tables in the same database as blog_data and give you information so you can access database from remote.
Type of task
There are not many lines of code you have to change in the program but you need to put a lot of effort in understanding why the program is not finding the keywords for some blogs etc. Program is 886 rows but only a minor part must be changed to solve the problems.
Milestones
First milestone =1/3 of total offer:
Will be set up from beginning and will be released when you can prove that you find all relevant blogs according to the list from manual verification + those which are already=ok
Second milestone=1/3 of total offer:
Will be set up when first milestone is released and released at delivery of working program solving both 1. and 2 without destroying anything else in program.
Third milestone=1/3 of total offer:
Will be set up and released only when delivered according to offered time schedule. Time is measured from you have received files and database access information.
More information before final accept
I will give you the chance to see the blog_statistics program before final accept.
Guaranteed lead time is important when evaluating bids but note the third milestone!
Willing to satisfy your needs in no time and for less amount of money. Experienced PHP/HTML/MySQL etc. developer. Check my profile for further information about my skills.