Hello,
I have an active license but I'd like to try asking for help here first before opening a ticket. Someone else maybe [hopefully] has had this same issue/experience (and this could help others). This is for LiteSpeed Web Server (not OpenLiteSpeed).
I recently converted from CentOS 7 to AlmaLinux 8.7 (I use WHM/cPanel).
On CentOS 7 this [crawler] script worked with no problems. So I'm wondering if there is something I am missing when it comes to configuration.
I'm using Magento [2.4.5-p1] but this topic is more towards the crawler script - I'm using the script linked on this page: Crawler Script | Magento 2 | LiteSpeed Documentation (litespeedtech.com). EDIT: I should also add that I have the crawler "enabled" as instructed HERE (the same as it was on CentOS 7).
As I mentioned above, it worked for the past two months with no issues, but now that I have upgraded this is what happens:
When I run the standard command (bash or sh - the error is the same):
The following error pops up in the terminal:
If I run this command (for individual URLs) it runs no problem:
If I remove the "-d" then it says "Sitemap connection success, but is not a valid xml" which is accurate because it's a URL - but that means it is sort of working but not completely.
Now you might be thinking "well the .xml sitemap file is missing" - I have checked it by going to the URL and the sitemap file loads just fine. I also re-ran the chown command on the crawler script file (I checked ls -l and it's the same owner config as it was on CentOS 7). Tried 777 for permissions on the crawler file and the .xml file. No such luck. I have searched on here and googled for hours, can't seem to find much on this. I'm thinking it's specific to LSWS and the crawler? There are no errors in the "error_log" file in public_html and there are no errors in the cPanel > Errors section.
Could anyone please point me in the right direction? I basically need the crawler because my store is very heavy and without it many pages load around 5-8 seconds which is not good for customers (once loaded the cache works as intended, but it's that first page load that hurts the worst).
Thank you in advance, I really appreciate your time.
I have an active license but I'd like to try asking for help here first before opening a ticket. Someone else maybe [hopefully] has had this same issue/experience (and this could help others). This is for LiteSpeed Web Server (not OpenLiteSpeed).
I recently converted from CentOS 7 to AlmaLinux 8.7 (I use WHM/cPanel).
On CentOS 7 this [crawler] script worked with no problems. So I'm wondering if there is something I am missing when it comes to configuration.
I'm using Magento [2.4.5-p1] but this topic is more towards the crawler script - I'm using the script linked on this page: Crawler Script | Magento 2 | LiteSpeed Documentation (litespeedtech.com). EDIT: I should also add that I have the crawler "enabled" as instructed HERE (the same as it was on CentOS 7).
As I mentioned above, it worked for the past two months with no issues, but now that I have upgraded this is what happens:
When I run the standard command (bash or sh - the error is the same):
Code:
sh M2-crawler.sh -c -r -v https://my-domain.com/sitemap.xml
or
sh M2-crawler.sh https://my-domain.com/sitemap.xml
Code:
Curl operation/command failed due to server return code - 404, exit
Code:
sh M2-crawler.sh -c -v -d https://my-domain.com/category
Now you might be thinking "well the .xml sitemap file is missing" - I have checked it by going to the URL and the sitemap file loads just fine. I also re-ran the chown command on the crawler script file (I checked ls -l and it's the same owner config as it was on CentOS 7). Tried 777 for permissions on the crawler file and the .xml file. No such luck. I have searched on here and googled for hours, can't seem to find much on this. I'm thinking it's specific to LSWS and the crawler? There are no errors in the "error_log" file in public_html and there are no errors in the cPanel > Errors section.
Could anyone please point me in the right direction? I basically need the crawler because my store is very heavy and without it many pages load around 5-8 seconds which is not good for customers (once loaded the cache works as intended, but it's that first page load that hurts the worst).
Thank you in advance, I really appreciate your time.
Last edited: