What Are Soft 404s and How to Fix Them


What are Soft 404s and How to Fix Them

You’re likely already familiar with 404 errors that display a “page not found” message to visitors. While these pages serve a purpose, they can be harmful to your website if misused.

An example of a “page not found” error used in the wrong context is a soft 404 error. These can wreak havoc on your search engine rankings, website traffic, and even credibility.

Below, I’ll define soft 404 errors and cover their difference with regular (or hard) 404 errors. Then I’ll explain how to find these errors and resolve them.

What Are Soft 404 Errors?

A 404 error happens when a non-existent page (i.e., it previously existed but has since been removed) shows a “page not found” message when someone attempts to visit the URL.

When it’s a soft 404 error, the error is conveyed to website visitors but not to Google crawl bots. This means the pages continue to be crawled and indexed on Search Engine Results Pages (SERPs.)

A soft 404 is not an official status, rather, it is one Google creates to alert you. This means Google and other search engines continue to crawl these webpages until you take action.

What Is the Difference Between (Hard) 404 and Soft 404 Errors?

The difference between hard 404 errors and soft 404 errors is the status they display to search engines.

With a hard 404 error, the 404 error code is returned to both page visitors and search engines. This tells both that the page doesn’t exist. Here’s an example of a hard 404 error from the clothing site ModCloth:

modcloth 404 page for soft 404 errors

With a soft 404 error, the 404 error code is displayed to visitors but not search engines.

Instead, they’ll see a 200 OK status which gives them the green light to crawl.

Can Soft 404 Errors Impact My Site Ranking?

The short answer is yes, soft 404 errors can impact your website’s ranking on SERPs. To understand why you need to understand how Google crawls a website.

To prevent strain on a website’s server and reduce the workload for the crawl bots, Google uses something called Crawl Budget to determine how much and how often to crawl a site. The Crawl Budget is composed of Crawl Rate and Crawl Demand.

According to Google, the definitions are as follows:

  • Crawl Rate is the number of requests per second Googlebot makes to your site when it is crawling it.
  • Crawl Demand is how much Google wants to crawl your site based on popularity, how long it’s been since it crawled, etc.

When you have a large number of soft 404 errors on your website, you’re using a considerable amount of your crawl budget. This means less of your existing pages are crawled, leading to fewer and poorer rankings in SERPs.

Think of it this way: If your site has 100 pages, but Google determines your crawl budget is 70, then Google will only crawl 70 percent of your pages. If 10 pages are soft 404 errors, there’s a risk that some or all of those pages will fall into the crawl budget of 70. Thus, you lose out on the search engines crawling valuable pages.

What Problems Can Soft 404s Cause?

Aside from lower rankings in SERPs, other problems are associated with soft 404 errors on your website.

One of the biggest problems is a poor user experience. Since soft 404 URLs still appear in search results, users may be directed to non-existent pages. This reflects poorly on your website and can cause users to bounce.

There can also be implications for the website’s performance and operations.

While 404 error pages don’t take up as much server space as pages with content, they do still take up bandwidth. If Google and other search engines direct traffic to a non-existent page, you’re still taking hits, which can slow website speed and performance.

How to Find (and Fix) Soft 404 Errors

Since soft 404 errors aren’t an official status, they aren’t as obvious to website owners.

This is where a tool like Google Search Console comes in. According to Google, Search Console “help[s] you measure your site’s Search traffic and performance, fix issues, and make your site shine in Google Search results.”

One issue Search Console can help you to resolve is soft 404 errors.

1. Find Soft 404 Errors in Google Search Console

To use Google Search Console, sign up for an account and connect your domain. It’s a simple process that takes only a few minutes to complete.

Once connected and Search Console has had time to collect information, you’ll find soft 404 errors in Google Search Console under Coverage:

How to Find (and Fix) Soft 404 Errors - Google Search Console

It can show in one of two ways: as an error or as excluded (as shown above).

The soft 404s that appear with an error status in Google Search Console are what you want to target first. Those with an excluded status should still be addressed, but Google has stopped crawling those pages for one reason or another.

2. If the Page Doesn’t Exist: Set Up a Proper 404 or 410 Error

If the soft 404 error you received is real, set up a proper 404 (not found) or 410 (content deleted) error on the server. This ensures website visitors and crawl bots see the correct error.

Is there a difference between the 404 and 410 error codes when it comes to search engines?

According to Google’s John Mueller, not really. Both of these error codes tell Google to stop indexing the URL, which is the outcome you want in either case.

Most Content Management Systems (CMSs) have built-in 404 and 410 error pages. To make the most out of your 404 pages, I recommend adding a few related blog posts or providing a search feature or tool.

On my site, for example, users can analyze their website SEO from my 404 page:

How to Find (and Fix) Soft 404 Errors - Set Up a Proper 404

If your CMS doesn’t offer one, you’ll need to configure your server and design the page yourself or with the help of a website developer.

While a personalized 404 error page can take time and money, it’s also a great branding opportunity.

3. If the Page Does Exist: Improve the Content and Reindex

Google is good at finding soft 404 errors, but they’re not perfect. They may incorrectly report a page as a soft 404, even if content exists on the page.

This generally occurs when the content is short or irrelevant.

Assuming the page is necessary, you’ll want to bulk up the content to ensure it’s not mistaken for a soft 404. That doesn’t mean using fluff or filler, but relevant and engaging content that improves the quality of your website.

If your webpage needs bulking, consider adding images, charts and diagrams, and data-driven content you can back up with outside references. These lend credibility to your website, which is especially important if the page was returning a soft 404 error.

4. If the Page Was Moved: Set Up a 301 Redirect

When combining content pages or moving content around, it’s important to set up a redirect as soon as possible. Otherwise, your website visitors may be directed to a 404 page.

If this results in soft 404 errors, then you’ll be happy to know the fix is simple. You just need to set up a permanent 301 redirect to the newer page on your website.

The majority of CMSs have a built-in redirect tool. Simply enter the old URL and the new one, and the traffic is redirected automatically. This guide walks you through setting up redirects on WordPress.

Once set up, the redirect is communicated to search engine crawlers so they crawl the correct pages.

Soft 404 Errors: Frequently Asked Questions

What are soft 404 errors?

Soft 404 errors are “page not found” errors that weren’t properly communicated to search engine crawl bots. The error shows correctly to website visitors but not to crawl bots, which means they continue to crawl those pages, wasting time and resources.

Will fixing soft 404 errors increase my website traffic?

When soft 404 errors are resolved, the crawl bots can focus on a larger percentage of your active content pages. As a result, more content appears in SERPs. In theory, this can increase website traffic.

What is the difference between regular 404 and soft 404 errors?

The difference between a regular 404 and a soft 404 error is what is communicated to search engine crawl bots. With a regular 404, the website server delivers the 404 error to the crawl bot. With a soft 404, the website server delivers a 200 OK status, which means the bots continue to crawl the page.

Will Google penalize me for soft 404 errors?

There’s no official penalty for soft 404 errors, though there are natural consequences if the errors aren’t resolved. These include lower visibility on search engine results pages and a poor user experience.

Conclusion: Soft 404 Errors

When a page is removed from your website, there are proper ways to signal this removal to both website visitors and search engine crawl bots.

The most common method is setting up a 404 error page. If not configured properly, though, the 404 error may not be communicated to crawl bots. Thus, a soft 404 error occurs.

While a soft 404 error can cause issues with search engine rankings as well as user experience, they are easy to find and resolve.

Depending on the circumstances, you might set up a permanent 404, set up a 301 redirect, or improve the page content and reindex it. These fixes take just a few minutes but can significantly impact website performance.

When was the last time you checked for soft 404 errors?





Source link

Leave a Reply