a thoughtful web.
Good ideas and conversation. No ads, no tracking.   Login or Take a Tour!
comment by user-inactivated
user-inactivated  ·  3262 days ago  ·  link  ·    ·  parent  ·  post: The Mixed-Up Brothers of Bogotá

    mk we're officially adding duplicate link checking to the list!

Isn't it already there? I swear I've gotten warnings that a link had been posted before.

But you didn't both post the same url; yours has a query string.





insomniasexx  ·  3262 days ago  ·  link  ·  

Yeah the link checking is very limited. There should be a way to check against full URLs or URLs minus the query strings, right?

user-inactivated  ·  3262 days ago  ·  link  ·  

Sometimes the query string is significant though. https://hubski.com/pub?id=242018 is not https://hubski.com/pub?id=242175 . I guess you could try fetching the url with and without the query string, and assume it's safe to compare without the query string if they both return 200s and mostly the same contents?

insomniasexx  ·  3262 days ago  ·  link  ·  

True.

Although I would set it up to be:

    Danger: This link has been submitted before...

And

    Warning: it is possible that this link has been submitted before...

The second being the query strings. It shows the post that it has found and allows you to submit it anyways, if you so choose. The percentage of sites that use query strings as ways to access content is very low, especially in the world of blogging and news outlets, as SEO is a big deal to them and Google doesn't like query strings.

Hubski is just special like that.

For the time being, we could manually white list sites that are known to use query strings as Urls