#4 Allow for items to be deduplicated under a different name.

Otwarty
otworzone 1 rok temu przez arkiver · 0 komentarzy
arkiver skomentował(-a) 1 rok temu

This will allow us to put context information in the item name, while not including the context information in the deduplication process.

Using this, we can get rid of most of urls:filters, since those are mostly in place to prevent loops on bad on bad page requisites. The context would hold the crawling depth, and we would not queue a new URL if depth is over a certain threshold.

Example:
url=URL&context=CONTEXT would be deduplicated under for example URL

This will allow us to put context information in the item name, while not including the context information in the deduplication process. Using this, we can get rid of most of urls:filters, since those are mostly in place to prevent loops on bad on bad page requisites. The context would hold the crawling depth, and we would not queue a new URL if depth is over a certain threshold. Example: `url=URL&context=CONTEXT` would be deduplicated under for example `URL`
arkiver dodał(-ęła) etykietę
enhancement
1 rok temu
Zaloguj się, aby dołączyć do tej rozmowy.
Brak kamienia milowego
Brak przypisanych
Uczestnicy 1
Powiadomienia
Termin realizacji

Brak ustawionego terminu realizacji.

Zależności

To zgłoszenie nie ma w tej chwili żadnych zależności.

Ładowanie…
Nie ma jeszcze treści.