Huge Hikashop site / Incredibly slow

  • Posts: 90
  • Thank you received: 2
11 years 10 months ago #91061

Hello to the HikaShop team.

I have created a website using your component and uploaded 151.000 products.
Each product has 3 options, totaling a number or 18 variants per product.

That rounds up to a teoretical 3 million products.

The DB has reached 1.4 GB, even if the uploaded csv lists total around 300-500MB.

The images are about 30GB in size (total), there are 2 images per product.

I believe that my website could also benefit you as a case study - see what happens when HikaShop is taken to the limit.

I've encountered a number of problems on my way to building this shop.
The biggest problem was creating the csv lists. Because the product is on 1 row and the variants have to be under that row as consecutive rows it was incredibly difficult to create that in an excel file.

For example, my website deals in paintings. The options deal with 1) type of product Painting/Poster 2) Size. There are 6 different possible sizes for a painting but not all of the paintings can be printed at all of those sizes. But because I could not make the csv otherwise i had to insert all the size variants for all the paintings and unpublish the not-necessary ones. Probably instead of 3 million variants I would've had half of that.

Another problem is that even if I specified the order for each product in each category in the CSV file, that didn't apply - it still has the order that was auto assigned.

Another problem is with the keywords - how do you sepparate them? If i sepparate them using comma when I import the csv and then export it again - only the first keyword is exported.

But the biggest problem of the moment is the incredibly slow speed the site is working on now.

The website is hosted on a virtual ubuntu, with 4 cores (8 threads) available and 2GB of ram in an HP server that's empty. The server is an HP blade - Intel Xeon E5620mhz with scsi raid 5.

I have about 30-40 categories with total of 151.000 individual products.

If i click on a category that has around 2000-5000 products inside, it takes between 10-30seconds to display the first 36 products.
If i click on a category that has around 40.000 products inside, it takes over 1-2 minutes.

If i click on a category that i've clicked previously it loads up ok - which means the cache is working, but even in that category, if I click on the next page of products (same category) it takes again the same amount of time.

Also if I click on a product it takes an age to load up.

I don't want to publish here publicly the address of the hosting server so please ask of me and I'll give you complete controll of the server.

I really need your help on this - I've got no options left, i managed to get around other shortcomings but I'm stuck now.

Thank you,

Lex.

Please Log in or Create an account to join the conversation.

  • Posts: 2334
  • Thank you received: 403
11 years 10 months ago #91120

Hi Lex,

About the speed problem, could you activate the debug mode and send us the report at the end of the page. There should be the time taken by the SQL query we'll be able to see what's taking so. But it sounds quite strange that even loading a product page is slow. You can PM me a link to your website if you want so we'll take a look by ourselves.

About the ordering, it depends of the category. It's the "category_ordering" column and I just tried it seems it working on my end.
About the keyword you indeed have to use comma but also a space after the comma. Like "one, two". Same thing here I just tried and it works.

By the way, do you have the last version?

Please Log in or Create an account to join the conversation.

  • Posts: 90
  • Thank you received: 2
11 years 10 months ago #91164

Yes, it's the latest version. I gave you access by PM for everything you need.
By the way, on the main page of the website, scroll down and there's a menu on the left side with 40 different categories. The first ones will load fast - they are in the cache. Try the lower ones in the list - you'll see the problem.

Please Log in or Create an account to join the conversation.

  • Posts: 90
  • Thank you received: 2
11 years 10 months ago #91366

I forgot to mention a problem - it might interest you.

The first import file I used had around 300-400 thousand records. After about 10 minutes it crashed and i kept making the file smaller and smaller until i saw that importing 2000 products with 18 variants worked.

So 36.000 products + variants is probably close to the maximum number the import can handle. I had to break my csv files in about 100 files to be able to import.

In the beginning it took around 5 minutes for each file to import. After about the 10'th, 15'th import file it took 15-18 minutes per import. Because mysql uses (i believe) 1 query per 1 thread, i disabled hyperthreading in my BIOS so 1 query would use one full core of the processor. That reduced the time back to about 13 minutes when it reched 20minutes/file.

Than I had an idea - start importing 2 files at once - I opened up to 2 administrator pages and started importing one file each - but it took twice as long - so no luck there.

I thought long and hard about it, but because I imported previously just the products and now i was importing the variants I realised (or hope so) that the DB wouldn't be scrambled.


Another problem is that my pictures all end in domain-name.xx.jpg (where xx is my domain) in the hikashop backend all pictures end in .xx no .xx.jpg - which is, i think, a bit weird. But the pictures work - they show up.

Another problem is that if I specify a certain image for one variant, then the rest of the variants MUST have specified images, otherwise it will show the hikashop default no-image. I have 18 variants plus main product. But i only use 2 pictures. Product + 6 variants use 1 image, 12 variants use the second one. So i had to mention, practicaly 17 times the path to a picture which was already specified to other 2 variants. 17 paths times 151.000 products = 2.6 million paths specified. This little detail probably ocuppies a lot of space in the DB - I don't know if this could be made otherwise but it's worth considering.

Another problem - this could be architectuarly-wise - It probably would be a good idea to be able to specify the default price for variants somwehere. I only have about 50 different prices in total - but i have almost 3 million prices in the DB because I had to specify a price for each variant for each product. Even if it's the same variant for all the products. This problem applies also to the name of the variant.

The way it is made now it's very user friendly to do anything, the learning curve it's steep but as i proved i could import many many products there should also be easy and effective ways of importing many many products.

lex.

Please Log in or Create an account to join the conversation.

  • Posts: 83049
  • Thank you received: 13408
  • MODERATOR
11 years 10 months ago #91556

Hi,

Thank you for the accesses.
I took a few hours today to look at the performance issues on the products listings and the product page.
I was able to trace the queries and reduce tremendously their execution time by adding indexes on the columns where it was necessary.
I had also to split a left join query to two queries in the file components/com_hikashop/views/product/view.html.php however, we won't be able to add that by default in HikaShop as we loose the possibility to order the products listings by product ordering.

I used principally the Masini category for my tests since it's the biggest one. Before starting, I had loading times above 70 seconds. I now have loading times always below 5 seconds and sometimes below 1 second.
I doubt that we'll be able to go much lower. With tables of 3M entries, it's hard to do much better. There sure is the possibility to cache in PHP the result of some queries but the gains are negligible compared to the index gains that I worked on today on your website.

The import was meant to be used to import very complex CSVs with lot of processing and, because of that, it can only import a few tens of thousands products at a time. In most cases that's enough to import the entire catalog. But you indeed have quite an enormous catalog if you take into account all the variants.

The following user(s) said Thank You: lex131

Please Log in or Create an account to join the conversation.

  • Posts: 90
  • Thank you received: 2
11 years 10 months ago #91563

Hello Nicolas,

Thank you very very much for your time and for the success :).

A few things:

1) Can you tell me if the ordering is now done by the product_code ? that's what i'm seeing in the administrator side.

2) When i'll update hikashop, as new versions appear, and when i'll update from standard to business or other versions, will I lose any of the modifications you just did? Should I save myself some files, that you modified, sepparately?

3) Should I add indexes to all the tables everywhere? Will that do anything?

4) Also, I replied to a topic before and you required access to the site and I never gave it - www.hikashop.com/en/forum/3-bug-report/7...st-update.html#89649 - it's the issue with the variant name displayed after the product name in front-end even if I disabled it in the configuration. Can you please help me with that too?

Thank you!!!

Lex.

Please Log in or Create an account to join the conversation.

  • Posts: 90
  • Thank you received: 2
11 years 10 months ago #91564

5) also, under each picture this error appears on the product page:

Notice: Undefined property: hikashopImageHelper::$image in /var/www/lumiere/administrator/components/com_hikashop/helpers/image.php on line 234 Notice: Trying to get property of non-object in /var/www/lumiere/administrator/components/com_hikashop/helpers/image.php on line 234

I also have the site locally and the error doesn't appear - I don't know if it's something you did or it's because of the server - but the virtual ubuntu is simply a clone, no differences...

Please Log in or Create an account to join the conversation.

  • Posts: 83049
  • Thank you received: 13408
  • MODERATOR
11 years 10 months ago #91797

Hi,

1. The order is now done on the product_id field. You can change that in the hikashop options of the listing. Based on what you select, additional index in the product will be required. For example, if you want to sort on the product_code column, you will need to add an index on that column in the hikashop_product table via phpmyadmin. Beware that the creation of an index on a 3M entries table will take several minutes.

2. You won't loose the indexes in the database. The modification I did in the view.html.php file I mentioned in my previous post will be removed though. So you should back it up and reapply the changes after the update.

3. NO. The more you have indexes, the less the performance gain of using them is great. So if you add indexes everywhere, you will be back to square 1.
You should only add an index to a column if there is a query doing a where or a join or an order by on that column and that the query is slow. If you have several wheres, or several on in the join or several order by, you will prefer to have the index on all the columns mentioned in them. The order of the columns in the index can also be important in some cases.

4. What was displayed was the product code, not the product variant name. I've disabled the rpoduct code display and turned on the characteristic values append to the product name and it's working fine as far as I can see.

5. It's fixed. That's something which is actually already fixed on our end.

The following user(s) said Thank You: lex131

Please Log in or Create an account to join the conversation.

  • Posts: 90
  • Thank you received: 2
11 years 10 months ago #91807

Thank you, thank you, thank you!!!!!

It looks incredible now :)

Please Log in or Create an account to join the conversation.

Time to create page: 0.076 seconds
Powered by Kunena Forum