#4 Allow for items to be deduplicated under a different name.

오픈
arkiver1 년 전을 오픈 · 0개의 코멘트
arkiver 코멘트됨, 1 년 전

This will allow us to put context information in the item name, while not including the context information in the deduplication process.

Using this, we can get rid of most of urls:filters, since those are mostly in place to prevent loops on bad on bad page requisites. The context would hold the crawling depth, and we would not queue a new URL if depth is over a certain threshold.

Example:
url=URL&context=CONTEXT would be deduplicated under for example URL

This will allow us to put context information in the item name, while not including the context information in the deduplication process. Using this, we can get rid of most of urls:filters, since those are mostly in place to prevent loops on bad on bad page requisites. The context would hold the crawling depth, and we would not queue a new URL if depth is over a certain threshold. Example: `url=URL&context=CONTEXT` would be deduplicated under for example `URL`
arkiver added the
enhancement
label 1 년 전
로그인하여 이 대화에 참여
마일스톤 없음
담당자 없음
참여자 1명
알림
마감일

마감일이 설정되지 않았습니다.

의존성

이 이슈는 어떠한 의존성도 가지지 않습니다.

불러오는 중...
아직 콘텐츠가 없습니다.