Skip to content
    Moz logo Menu open Menu close
    • Products
      • Moz Pro
      • Moz Pro Home
      • Moz Local
      • Moz Local Home
      • STAT
      • Moz API
      • Moz API Home
      • Compare SEO Products
      • Moz Data
    • Free SEO Tools
      • Domain Analysis
      • Keyword Explorer
      • Link Explorer
      • Competitive Research
      • MozBar
      • More Free SEO Tools
    • Learn SEO
      • Beginner's Guide to SEO
      • SEO Learning Center
      • Moz Academy
      • SEO Q&A
      • Webinars, Whitepapers, & Guides
    • Blog
    • Why Moz
      • Agency Solutions
      • Enterprise Solutions
      • Small Business Solutions
      • Case Studies
      • The Moz Story
      • New Releases
    • Log in
    • Log out
    • Products
      • Moz Pro

        Your all-in-one suite of SEO essentials.

      • Moz Local

        Raise your local SEO visibility with complete local SEO management.

      • STAT

        SERP tracking and analytics for enterprise SEO experts.

      • Moz API

        Power your SEO with our index of over 44 trillion links.

      • Compare SEO Products

        See which Moz SEO solution best meets your business needs.

      • Moz Data

        Power your SEO strategy & AI models with custom data solutions.

      Save big on Moz Pro!
      Limited time offer

      Save big on Moz Pro!

      See pricing
    • Free SEO Tools
      • Domain Analysis

        Get top competitive SEO metrics like DA, top pages and more.

      • Keyword Explorer

        Find traffic-driving keywords with our 1.25 billion+ keyword index.

      • Link Explorer

        Explore over 40 trillion links for powerful backlink data.

      • Competitive Research

        Uncover valuable insights on your organic search competitors.

      • MozBar

        See top SEO metrics for free as you browse the web.

      • More Free SEO Tools

        Explore all the free SEO tools Moz has to offer.

      Access the Moz API for less
      Save on Moz Data

      Access the Moz API for less

      Hurry - ends Dec 6th!
    • Learn SEO
      • Beginner's Guide to SEO

        The #1 most popular introduction to SEO, trusted by millions.

      • SEO Learning Center

        Broaden your knowledge with SEO resources for all skill levels.

      • On-Demand Webinars

        Learn modern SEO best practices from industry experts.

      • How-To Guides

        Step-by-step guides to search success from the authority on SEO.

      • Moz Academy

        Upskill and get certified with on-demand courses & certifications.

      • SEO Q&A

        Insights & discussions from an SEO community of 500,000+.

      Save 20% on all Moz Academy courses
      Limited time offer

      Save 20% on all Moz Academy courses

      Level up your SEO
    • Blog
    • Why Moz
      • Small Business Solutions

        Uncover insights to make smarter marketing decisions in less time.

      • Agency Solutions

        Earn & keep valuable clients with unparalleled data & insights.

      • Enterprise Solutions

        Gain a competitive edge in the ever-changing world of search.

      • The Moz Story

        Moz was the first & remains the most trusted SEO company.

      • Case Studies

        Explore how Moz drives ROI with a proven track record of success.

      • New Releases

        Get the scoop on the latest and greatest from Moz.

      Explore Moz updates & save up to 40%
      Big Savings!

      Explore Moz updates & save up to 40%

      See the latest
    • Log in
      • Moz Pro
      • Moz Local
      • Moz Local Dashboard
      • Moz API
      • Moz API Dashboard
      • Moz Academy
    • Avatar
      • Moz Home
      • Notifications
      • Account & Billing
      • Manage Users
      • Community Profile
      • My Q&A
      • My Videos
      • Log Out

    The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. Home
    2. Moz Tools
    3. Getting Started
    4. Crawler was not able to access the robots.txt

    Unsolved Crawler was not able to access the robots.txt

    Getting Started
    robots.txt crawl error
    7
    9
    290
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with question management privileges can see it.
    • andrewrench
      andrewrench Subscriber last edited by

      I'm trying to setup a campaign for jessicamoraninteriors.com and I keep getting messages that Moz can't crawl the site because it can't access the robots.txt. Not sure why, other crawlers don't seem to have a problem and I can access the robots.txt file from my browser. For some additional info, it's a SquareSpace site and my DNS is handled through Cloudflare. Here's the contents of my robots.txt file:

      # Squarespace Robots Txt
      
      User-agent: GPTBot
      User-agent: ChatGPT-User
      User-agent: CCBot
      User-agent: anthropic-ai
      User-agent: Google-Extended
      User-agent: FacebookBot
      User-agent: Claude-Web
      User-agent: cohere-ai
      User-agent: PerplexityBot
      User-agent: Applebot-Extended
      User-agent: AdsBot-Google
      User-agent: AdsBot-Google-Mobile
      User-agent: AdsBot-Google-Mobile-Apps
      User-agent: *
      Disallow: /config
      Disallow: /search
      Disallow: /account$
      Disallow: /account/
      Disallow: /commerce/digital-download/
      Disallow: /api/
      Allow: /api/ui-extensions/
      Disallow: /static/
      Disallow:/*?author=*
      Disallow:/*&author=*
      Disallow:/*?tag=*
      Disallow:/*&tag=*
      Disallow:/*?month=*
      Disallow:/*&month=*
      Disallow:/*?view=*
      Disallow:/*&view=*
      Disallow:/*?format=json
      Disallow:/*&format=json
      Disallow:/*?format=page-context
      Disallow:/*&format=page-context
      Disallow:/*?format=main-content
      Disallow:/*&format=main-content
      Disallow:/*?format=json-pretty
      Disallow:/*&format=json-pretty
      Disallow:/*?format=ical
      Disallow:/*&format=ical
      Disallow:/*?reversePaginate=*
      Disallow:/*&reversePaginate=*
      

      Any ideas?

      a.grzegorczyk 1 Reply Last reply Reply Quote 0
      • Ahmadniaz78501
        Ahmadniaz78501 last edited by

        Hi there,

        If a crawler cannot access your robots.txt file, it might be due to server issues or incorrect file permissions. Ensure the robots.txt file exists on your server and check for any typos in the file name or path. Also, confirm that your server returns a 200 status code for the robots.txt file. If the server is incorrectly configured to return a 403 or 404 error, crawlers will be unable to access it.

        1 Reply Last reply Reply Quote 0
        • Ahmadniaz78501
          Ahmadniaz78501 last edited by

          Hi andrewrench,

          If a crawler cannot access your robots.txt file, it might be due to server issues or incorrect file permissions. Ensure the robots.txt file exists on your server and check for any typos in the file name or path. Also, confirm that your server returns a 200 status code for the robots.txt file. If the server is incorrectly configured to return a 403 or 404 error, crawlers will be unable to access it.

          1 Reply Last reply Reply Quote 0
          • Poojavm
            Poojavm last edited by

            When a crawler is unable to access the robots.txt file of a website, it typically means that the file is either missing, restricted, or inaccessible due to server issues. The robots.txt file provides directives to web crawlers about which parts of a website can or cannot be accessed and indexed. Here are some possible reasons and solutions:

            Possible Reasons:
            File Does Not Exist: The robots.txt file might not be present on the server.
            Permission Issues: The file could have restricted permissions that prevent it from being accessed by the crawler.
            Server Errors: Temporary server issues, such as a 403 Forbidden error, could block the crawler from accessing the file.
            Incorrect URL: The crawler might be trying to access the robots.txt file using the wrong URL or path.
            Blocked by Firewall: The server's firewall might be configured to block certain crawlers or user agents.
            Solutions:
            Create or Restore the robots.txt File: Ensure that the robots.txt file exists in the root directory of your website (e.g., https://www.example.com/robots.txt).
            Check File Permissions: Make sure the file has appropriate read permissions (typically 644).
            Review Server Logs: Check your server logs to identify any issues or errors related to the file's access.
            Verify URL: Ensure that the crawler is using the correct URL to access the file.
            Firewall Configuration: Review your firewall settings to allow access to the robots.txt file for all legitimate crawlers.
            Additional Steps:
            Test with Google Search Console: Use the "Robots.txt Tester" tool in Google Search Console to identify any issues.
            Check for Manual Blocking: Ensure that you haven't accidentally blocked access to the robots.txt file in your server's configuration or with specific rules in the file itself.
            By addressing these issues, you can ensure that crawlers can access your robots.txt file and follow the directives you've set for your website's content.

            link text

            1 Reply Last reply Reply Quote 0
            • Ahmadniaz78501
              Ahmadniaz78501 last edited by

              Hello,

              If a crawler cannot access your robots.txt file, this could create issues in how your site is indexed. Here are some steps to identify and address this problem:

              Check File Permissions:

              Make sure your robots.txt file is accessible. Set its permissions (typically 644) so it's readable by everyone; this can be accomplished using either your hosting control panel or FTP client.

              Verify File Location:

              Your robots.txt file should be located in the root directory of your website - for example if example.com was the domain, this would mean accessing it at example.com/robots.txt

              Make sure that your server is configured appropriately to serve the robots.txt file by reviewing its.htaccess or server settings to ensure there are no rules blocking access. Test with Google

              Search Console:

              Google Search Console makes it easy to test your robots.txt file using their "Robots.Txt Tester" under "Crawl." Simply visit this section of their platform, select your file, and see if Google can access it or if there are any errors with it.

              Review Content:

              Check that the content of your robots.txt file is accurate. Change content according to your requirements.

              Check for Syntax Errors:

              Even small syntax errors can have serious repercussions. Double-check for typos or formatting issues before publishing content to your site.

              1 Reply Last reply Reply Quote 0
              • alexcale
                alexcale last edited by

                If a crawler cannot access the robots.txt file, it may be due to server misconfigurations, incorrect file permissions, or the file being missing. The robots.txt file is essential for guiding web crawlers on which pages they are allowed to access.

                1 Reply Last reply Reply Quote 0
                • BlackcatSEOinc
                  BlackcatSEOinc last edited by

                  Did you check google.com/webmasters/tools/robots-testing-tool ? all good here ?

                  1 Reply Last reply Reply Quote 0
                  • ww4686101
                    ww4686101 last edited by

                    Hi Andrew,

                    It sounds like you're running into an issue with Moz being unable to access your robots.txt file, even though other crawlers and your browser can access it. Since your site is on SquareSpace and DNS is managed through Cloudflare, there could be a couple of things to consider:

                    Cloudflare Settings: Sometimes, Cloudflare's security settings (like firewall rules or bot management) can block certain bots, including Moz's crawler. You might want to check your Cloudflare settings to ensure that Moz’s IP addresses or user agents aren’t being inadvertently blocked.

                    Robots.txt Caching: Cloudflare may cache your robots.txt file. Try purging the cache for that file specifically to ensure the most up-to-date version is being served. This can sometimes resolve issues where different services see different versions of the file.

                    SquareSpace Configuration: Double-check if SquareSpace has any additional settings or restrictions that might affect how external crawlers, like Moz, interact with your site. Since SquareSpace handles a lot of things on the backend, their support might be able to provide more insight.

                    Allow Moz in robots.txt: If Moz’s specific user agent isn't listed in your robots.txt file, you could try explicitly allowing it by adding the following to the top of your file:

                    User-agent: rogerbot
                    Allow: /
                    
                    

                    If the issue persists after checking these, reaching out to Moz support with specifics about your setup might help you get more targeted assistance.

                    1 Reply Last reply Reply Quote 0
                    • a.grzegorczyk
                      a.grzegorczyk Staff @andrewrench last edited by

                      Hi @andrewrench!

                      Aneta from the help team here. I had a look at this for you and I can see that we are getting a 403 forbidden error when pinging your site. I would recommend looking into this and if you need any further help please don't hesitate to reach out to help@moz.com.
                      35ac3d17-de75-4592-bd18-0daa420fa55c-image.png

                      1 Reply Last reply Reply Quote 1
                      • 1 / 1
                      • First post
                        Last post

                      Got a burning SEO question?

                      Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                      Start my free trial


                      Browse Questions

                      Explore more categories

                      • Moz Tools

                        Chat with the community about the Moz tools.

                      • SEO Tactics

                        Discuss the SEO process with fellow marketers

                      • Community

                        Discuss industry events, jobs, and news!

                      • Digital Marketing

                        Chat about tactics outside of SEO

                      • Research & Trends

                        Dive into research and trends in the search industry.

                      • Support

                        Connect on product support and feature requests.

                      • See all categories

                      Related Questions

                      • blogwoman1

                        is my robot file correct

                        robots.txt ranking

                        hi, can anyone let me know if my robot file is correct. my pages and wordpress posts are being indexed but not showing in serps and wondering if my robot file is wrong https://www.in2town.co.uk/robots.txt

                        SEO Tactics | | blogwoman1
                        0
                      • Simon-Plan

                        Unsolved What would the exact text be for robots.txt to stop Moz crawling a subdomain?

                        robots.txt crawl disallow subdomain sub-domain

                        I need Moz to stop crawling a subdomain of my site, and am just checking what the exact text should be in the file to do this. I assume it would be: User-agent: Moz
                        Disallow: / But just checking so I can tell the agency who will apply it, to avoid paying for their time with the incorrect text! Many thanks.

                        Getting Started | | Simon-Plan
                        0
                      • ClaireU

                        Unsolved How do I cancel this crawl?

                        crawl error crawl in progress crawl stalled

                        The latest crawl on my site was the 4th Jan with a current crawl 'in progress'. How do i cancel this crawl and start a new one? I've been getting keyword ranking etc but no new issues are coming through. Screenshot 2022-05-31 083642.jpg

                        Moz Tools | | ClaireU
                        0
                      • TeamOneRep

                        Can I access old data/keyword research if I cancel my Moz Pro account?

                        I'm currently on the free month trial period for Moz Pro and I will probably cancel the account before the free period ends, but if I want to renew my subscription later, what happens to all the previous data? And does all the keyword research I've done disappear when I cancel it, or is it restored when I renew the subscription? Any insight is helpful! Thank you!

                        Getting Started | | TeamOneRep
                        0
                      • Avatardesk1

                        Crawler Accessibilit

                        In Insights section of MOZ campaign, I'm seeing this: https://imgur.com/Gu2K9dz Here are the contents of robots.txt: User-agent: *
                        Disallow: /wp-admin/ Sitemap: http://website.com.com/sitemap_index.xml Can you please let me know what is wrong here? Gu2K9dz

                        Getting Started | | Avatardesk1
                        1
                      • DGAU

                        Site with 2 domains - 1 domain SEO opimised & 1 is not. How best to handle crawlers?

                        Situation: I have a dual domain site:
                        Domain 1 - www.domain.com is SEO optimised with product pages and should of course be indexed.
                        Domain 2 - secure.domain.com is not SEO optimised and simply has checkout and payment gateway pages. I've discovered that Moz automatically crawls Domain 2 - the secure.domain.com site and consequently picks up hundreds of errors.
                        I have put an end to this by adding a robots.txt to stop rogerbot and dotbot (mozs crawlers) from crawling domain 2. This fixes my errors in Moz reports however after doing more research into 'Crawler Control' I figure this might be the best option. My Question: Instead of using robots.txt to stop moz from crawing all of Domain 2 should I use on each page of domain 2? I believe this would then allow moz and google to crawl Domain 2 but also tell them both not to index it.
                        My understanding is that this would be best, and might even help my overall SEO by telling google not to give any SEO value to the Domain 2 pages?

                        Getting Started | | DGAU
                        0
                      • tigersohelll

                        Our crawler was not able to access the robots.txt file on your site

                        Hello Mozzers! I've received an error message saying the site can't be crawled because Moz is unable to access the robots.txt. I've spoken to the webmaster and he can't understand why the robot.txt can't be accessed in Moz. https://www.thefurnshop.co.uk/robots.txt and Google isn't flagging anything up to us. Does anyone know how to solve this problem? Thanks

                        Getting Started | | tigersohelll
                        0
                      • Inframan

                        How Do I Scan My New Site & Grade My Work With The Robots Turned Off? For Pre-Inspection before I launch my Site?

                        I have a new site that has all the bots turned off so google can't index my site until I'm finished it. I've been working on this site for a couple months now optimizing and I was wondering if there was anyway I can run a preliminary scan on the site for my titles, URLs, Headers, Alt Tags and pretty much anything else that will grade my work and tell me if i did anything wrong? Can MOZ do this with the Bots turned off? Thanks

                        Getting Started | | Inframan
                        0
                      Moz logo
                      • Contact
                      • Community
                      • Free Trial
                      • Terms & Privacy
                      • Accessibility
                      • Jobs
                      • Help
                      • What's New
                      • News & Press
                      • MozCon
                      © 2021 - 2024 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.

                      Looks like your connection to Moz was lost, please wait while we try to reconnect.