#4 Allow for items to be deduplicated under a different name.

オープン
arkiver1年前に作成 · 0件のコメント
arkiver1年前 にコメント

This will allow us to put context information in the item name, while not including the context information in the deduplication process.

Using this, we can get rid of most of urls:filters, since those are mostly in place to prevent loops on bad on bad page requisites. The context would hold the crawling depth, and we would not queue a new URL if depth is over a certain threshold.

Example:
url=URL&context=CONTEXT would be deduplicated under for example URL

This will allow us to put context information in the item name, while not including the context information in the deduplication process. Using this, we can get rid of most of urls:filters, since those are mostly in place to prevent loops on bad on bad page requisites. The context would hold the crawling depth, and we would not queue a new URL if depth is over a certain threshold. Example: `url=URL&context=CONTEXT` would be deduplicated under for example `URL`
arkiver がラベル
enhancement
を追加 1年前
サインインしてこの会話に参加。
マイルストーンなし
担当者なし
1 人の参加者
通知
期日

期日は未設定です。

依存関係

この課題に依存関係はありません。

読み込み中…
まだ内容がありません