Lots of duplicates

  • Posts: 82863
  • Thank you received: 13372
  • MODERATOR
1 year 2 months ago #354687

Hi,

Well, I now see the canonical URL tag in the header of your page:
i.imgur.com/hmfSlPF.png
That means that even if the URL brands.html?limitstart_category=0&limit=12&limit=12&limit=12&limit=12&limit=12&limit=12&limit=12&limit=12 is crawled by a search engine, it will only index the canonical URL and users searching for things on the search engine will only see the canonical URL.
So I think it's fine like that.

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354836

Hello. The fact is that the crawler bypasses all the pages. What can affect the crawling budget of the site. When I scanned the site with a crawler, a lot of memory resources began to consume. Since such addresses were constantly generated at some point, and it felt like they never ended, at some point I stopped scanning the site, as the site was down.

Last edit: 1 year 2 months ago by ilyaminsk.

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354837

For some reason, there is an endless generation of such urls

Please Log in or Create an account to join the conversation.

  • Posts: 82863
  • Thank you received: 13372
  • MODERATOR
1 year 2 months ago #354842

Hi,

Well, that means that the crawler has a problem.
It shouldn't hog all the resources of the server.
Search engines' crawlers don't do that.

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354851

Hello. There is a limited amount of megabytes of memory on the server, and such urls consume a large amount, because if you bypass the site with any crawler, there is an endless generation of such urls.
Go around try the site with any crawler: Screaming Frog SEO Spider, or any other.

URLs like this shouldn't be generated

This message contains confidential information


It is not normal

Please Log in or Create an account to join the conversation.

  • Posts: 82863
  • Thank you received: 13372
  • MODERATOR
1 year 2 months ago #354855

Hi,

Finally, you provided some information on how you got that URL.
So I went and downloaded that tool called "Screaming Frog SEO Spider" and ran it on the URL /products/katalog/furnitura-dlya-dverej.html of your website.
I did get URLs with two "limit" parameters when it shouldn't happen, but not more as the tool is limited to 500 URLs in its free version.
What I learned is that these URLs with the extra limit parameter come from the rel="prev" and rel="next" meta tags of the products/katalog/furnitura-dlya-dverej.html?limitstart_category=12&limit=12 page :
i.imgur.com/VtLDKGW.png
However, as far as I know, HikaShop doesn't add these meta tags to the pages:
i.imgur.com/XRJqJF6.png
I'm thinking you're using a SEO plugin which does that.

I also just used alphacrawler.pro/ on your website and didn't see any problem regarding this issue. So maybe using another tool would help.

But maybe the thing adding the rel tags need to be fixed on your end.

In any case, whether the SEO crawler you're using is at fault or the SEO plugin adding the rel tags is at fault, it doesn't seem to be something we can do anything about on our end.

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354858

Hello. There is a free scanner

This message contains confidential information

With it, you can check the entire site


Third party seo is not used. Here is everything that is used:

This message contains confidential information

It is not clear why then such urls are formed:
ilsanmarket.by/brands.html?limitstart_ca...=0&limit=12&limit=12
Maybe it has something to do with the template?

Attachments:

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354859

It's really unusual going from page 1 to page 2 and so on, the length of the url gets longer each time

Attachments:

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354861

Watch this video

This message contains confidential information

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354862

"I also just used alphacrawler.pro/ on your website and didn't see any problem regarding this issue. So maybe using another tool would help." - You didn’t see it here because the site has a 100 url limit of more than 5000 urls

Please Log in or Create an account to join the conversation.

  • Posts: 82863
  • Thank you received: 13372
  • MODERATOR
1 year 2 months ago #354870

Hi,

Thanks. With that tool, I was finally able to properly reproduce the problem and I think I found what is going on.
Change the code:

		elseif($ret_start && !$ret_limit)
			return $current_url . '&limit='.$this->limit;
to:
elseif($ret_start && !$ret_limit) {
			if(strpos($current_url, '&limit='.$this->limit)) {
				return $current_url;
			}
			return $current_url . '&limit='.$this->limit;
		}
in the file administrator/components/com_hikashop/helpers/pagination.php and it should prevent this from happening.
Please let us know how it goes so that we can include the patch on our end for the next version of HikaShop.

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354888

Hi. There is no longer an endless generation of url tails, which is certainly pleasing. I noticed another mistake, could you fix it so that 1 page was without tails like: ?limitstart_category=0&limit=12

Attachments:
Last edit: 1 year 2 months ago by ilyaminsk.

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354900

When I scan another such url is visible:

This message contains confidential information


only 70 url, it's not critical anymore

Last edit: 1 year 2 months ago by ilyaminsk.

Please Log in or Create an account to join the conversation.

  • Posts: 82863
  • Thank you received: 13372
  • MODERATOR
1 year 2 months ago #354899

Hi,

could you fix it so that 1 page was without tails like: ?limitstart_category=0&limit=12

No. This is normal. It's not a mistake and it doesn't have any negative impact.

When I scan another such url is visible

I see what the problem is here. It's similar to the other issue but with a different scale.
Change the code :
return $current_url.$sep.'limitstart'.$this->hikaSuffix.'='.$start.'&limit='.$this->limit;
to:
$return_url = $current_url.$sep.'limitstart'.$this->hikaSuffix.'='.$start;
		if(strpos($return_url, '&limit='.$this->limit)) {
			return $return_url;
		}
		return $return_url.'&limit='.$this->limit;
in administrator/components/com_hikashop/helpers/pagination.php and it will avoid these.
We'll add the change on our end too.

Last edit: 1 year 2 months ago by nicolas.
The following user(s) said Thank You: ilyaminsk

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354929

Hello, added, and what did this code affect? I noticed that pages 1 and 2 under the same url



In general, it’s not entirely correct when pointing such a tail to the first page.

I looked at many online stores, most of them I did not see this. Also in the joomla blog itself, when you drive to 1 page, it is displayed without tails

Attachments:

Please Log in or Create an account to join the conversation.

  • Posts: 82863
  • Thank you received: 13372
  • MODERATOR
1 year 2 months ago #354932

Hi,

The patch I provided is for the URL

/products/katalog/furnitura-dlya-dverej.html?limitstart_category=0&limit=12&limitstart=96&limit=12
you were talking about in www.hikashop.com/forum/install-update/90...plicates.html#354900
It doesn't affect the URL of the pagination page you're pointing out here.

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #354953

Is it possible to make 1 page without a tail?, as indicated in the screenshot here: #354929

Please Log in or Create an account to join the conversation.

  • Posts: 82863
  • Thank you received: 13372
  • MODERATOR
1 year 2 months ago #354954

Hi,

Almost anything is possible in programing if you spend enough time and money on a problem.

It's possible to change that. However, it's not easy. The code which modifies the current URL to generate the URLs of the pagination can potentially remove the limitstart parameter if it's 0, but removing the limit parameter when its 12 would be quite difficult as the pagination system doesn't know what is the default limit of the current listing.
That would probably require hours of work to make a first solution, and even so, it would probably have negative and unforseen side effects which will take again hours of work to fix.
On the positive side, even if we did that work, it wouldn't change anything since there is already a canonical URL on the page which discards the URL of the pagination when search engines crawl your website.

That's why I answered you :

No. This is normal. It's not a mistake and it doesn't have any negative impact.

Last edit: 1 year 2 months ago by nicolas.

Please Log in or Create an account to join the conversation.

  • Posts: 220
  • Thank you received: 10
1 year 2 months ago #355015

Hello. Thanks for the answer

Please Log in or Create an account to join the conversation.

Time to create page: 0.117 seconds
Powered by Kunena Forum