edit-task
Home
Up
Delete
Task Name:
Task Description:
UPDATE #]3250 [PG view/article TAB news] DEDUPLICATION and LIMITATION - limiting the amount of results returned and preventing duplicate articles [REPLACING]none[WHY] prevent duplication on "consolidated" list and to control the total amount of,PUBL
TaskGroup ID:
Start Date:
Start Time:
Duration:
Priority:
Status:
To Do
Completed
In Process
Add Photo:
Owner ID:
Content:
use HTML
Edit Content
<h2>[-previously-]</h2> <ol> <li><strong>] <a href="http://sospep.com/view/task?id=2827" target="_blank">feature idea 3250</a></strong></li> <ol> <li>x] overview articles describing what this feature 'news tab' is and what it does, it breifly outlines our plan on how we are going to implement it. </li> </ol></ol> <h2>[-currently-]</h2> <ol> <li><strong>x] this was easy, just modified our YQL query to return only "desired QTY" using LIMIT constraint</strong></li> <ol> <li>x] YQL basically did the rest</li> </ol> <li><strong>x] added UNIQUE constraint on title field to prevent duplicated articles</strong></li> <ol> <li>x] see notes 1</li> </ol></ol> <h2>[-next-]</h2> <ol> <li><strong>x] dedup -needs to be tested</strong></li> <ol> <li>] didnt see any duplicate results in the initial results set,</li> </ol> <li><strong>x] count limit works </strong></li> <ol> <li>( were getting max 100 results BUT dont know how it works, ie if url 1 has 100 results does it stop there or ??? )</li> </ol> <li>x] QRY - select * from rss where url IN ("<a href="http://news.cnet.com/8300-10805_3">http://news.cnet.com/8300-10805_3</a>-75.xml","<a href="http://rss.news.yahoo.com/rss/tech%22,%22http://news.google.com/news?pz=1&cf=all&ned=us&hl=en&topic=tc&output=rss">http://rss.news.yahoo.com/rss/tech","http://news.google.com/news?pz=1&cf=all&ned=us&hl=en&topic=tc&output=rss</a>") LIMIT 10 | unique(field="title")</li> </ol> <p> </p>