SEO Insights – March 2022 Report

Justin AldridgeApril 1, 2022

March 2022 – SEO Insights image

Duplicate Content

60% of the content on the internet is duplicate, according to Gary Illyes, webmaster trends analyst at Google. That’s quite an astonishing number and reflects the challenge that search engines face when trying to make sense of the billions of pages that they discover when crawling the web and trying to deliver the most relevant search results to users.

Duplicate content can take many forms and a significant amount of this 60% most likely comprises websites that have multiple versions able to be crawled by search engines (www and non-www, http and https, etc.), and pages that are duplicated through URL parameters such as sorting and view options.

March 2022 - URL Parameters

For example, these are all essentially the same page but all, if not handled correctly, can be crawled and indexed by Google:

http://website.com/page
http://website.com/page/
https://website.com/page/
http://www.website.com/page/
https://www.website.com/page/
http://website.com/page/?sort=asc
https://website.com/page/?sort=asc
http://www.website.com/page/?sort=asc
https://www.website.com/page/?sort=asc
http://website.com/page?sort=asc
https://website.com/page?sort=asc
http://www.website.com/page?sort=asc
https://www.website.com/page?sort=asc
https://www.website.com/page?sort=asc&view=grid

Can you spot all of the differences in the URL formats?

Then we have duplicate content that has been syndicated across multiple websites, publication of press releases, products with descriptions that are used by all suppliers, etc.

And finally, and the area which is a little more grey, is content that is very similar.

Sometimes there are only so many ways in which a product, service or answer to a question can be phrased. This often causes pages to appear to be very similar and they can be flagged as duplicate content too.

When so much of the internet is seen as duplicate, it shows how difficult it is to be heard through all of the noise. But it highlights just how critical it is that content needs to be viewed by search engines as unique, engaging and useful for it to then be considered for indexing and ranking.

Content for content’s sake just doesn’t work for search these days. It requires more effort than ever before to give it the best possible chance to appear in the search results. ‘Content is king’ is becoming more and more relevant by the day.

Instability of Search Results

Here at Artemis we have a rank tracker that our clients can access to see how the rankings for their key search terms are performing over time. The rank tracker is helpful as a guide to evaluate overall progress but often it can lead to some concerns from clients when the tracker turns red instead of green.

March 2022 - Rankings

This is quite normal search ranking behaviour! As Heraclitus, the Greek philosopher once quoted:

“Change is the only constant in life”

This is very true of search engines. In 2020, Google made 4,500 changes to its search results; that’s 12 per day. The majority of changes will have been relatively minor, such as spacing of elements in the results pages, changes in colours, etc., whilst others, such as core updates, will have been quite significant.

In addition to this, Google has several AI algorithms working to further refine the search results, such as RankBrain, neural matching, Bert and very shortly, MUM. AI becomes exponentially more intelligent the more it learns, and so over time we can expect changes in search to appear faster and faster by the day.

We are already seeing this behaviour and it’s why stable search results just don’t exist these days. It’s very rare that the top 10 results don’t change at all. In fact, just searching from a different location, different device, different time of the day or time of the year, the results can change. And if something hits the news, everything changes!

Google’s ranking algorithm wouldn’t and doesn’t work if its results don’t constantly change and evolve. We can, and have to, accept that there will always be changes from month to month in the rankings of keywords, sometimes even on a daily or weekly basis.

But the red days are not a time to panic or get demoralised. It’s quite normal search behaviour. The important thing is to keep working on improving the content, speed, usability and refinement of the pages and adapting them to how Google’s perceived intent is changing over time for each search query.

Google URL Parameters Tool

Continuing with the change and duplicate content themes, Google announced in March that on April 26th they will be removing the URL parameter tool from Search Console.

March 2022 - GSC URL Parameters

This tool was introduced many years ago to help webmasters handle how Google crawls and indexes pages with URL parameters, for example, parameters which don’t actually make any difference to the actual content of the page, such as those used for sorting results.

The examples above show URLs with an ascending parameter included, for example:

https://www.website.com/page/?sort=asc

But, you can also often make a page display its content, such as products, in a descending format, for example:

https://www.website.com/page/?sort=dsc

The pages are the same, just displayed in a different way for the user. The URL parameter tool was introduced so that you could tell Google to ignore the “sort” parameter as that doesn’t change the content of the page. It was helpful to improve crawling and always having the correct page indexed, and only that page indexed, and not all of the variants.

However, Google has become very clever now at knowing how to handle URL parameters when the website hasn’t explicitly stated how to handle these through no-indexing or blocking crawlers in robots.txt.

It was never a very used tool and webmasters, SEOs and many content management systems are now much better at telling Google what to crawl and what not to crawl on a website.

Farewell Universal Analytics, hello Google Analytics 4 (GA4)

If you’ve logged into your Google Analytics (GA) account recently you may have spotted this new message:

March 2022 - GA3 message

The trusted, faithful and well-used Google Analytics that we have all become so reliant on for so many years is moving on and making way for an all-new version of analytics called GA4.

The announcement by Google in March that GA would stop processing data from July 2023 has had many SEOs in tears. GA4 is currently quite an unloved new product from Google, mainly because it’s so different to what we have been so familiar with for so long.

March 2022 - GA4

However, when you start spending time working with GA4, learning how it works and how to generate the reports and data that you need, it’s actually a far superior product to GA. It is also far quicker than GA (a much appreciated improvement) and uses AI extensively to help users by surfacing useful insights based on the data collected.

GA4 comes at a time where there is an increased shift to a cookie-less online world. It has been designed to be able to still collect or interpret data even when a user has chosen to not accept cookies on a website. Google originally stated the following about this:

“Because the technology landscape continues to evolve, the new Analytics is designed to adapt to a future with or without cookies or identifiers. It uses a flexible approach to measurement, and in the future, will include modelling to fill in the gaps where the data may be incomplete.”

Essentially, GA4 uses AI to fill in the gaps when there is missing data. So all is not lost when users are on your website but their cookies are disabled. With the current Google Analytics that data is never gathered and lost forever.

When you compare GA and GA4 data today you’ll notice some slight differences in the numbers in the reports. That’s because GA4 is capturing the data in a different way to GA and so these differences are a consequence of that.

We have already been preparing all of our clients for the changeover to GA4. We set up the GA4 accounts as soon as it was released which means that they’ve been accumulating data all this time. There is no backward compatibility of data with GA so it’s important to have this data now in GA4 for comparison reasons going forward.

Additionally, we will be providing some guides for our clients to become familiar with GA4 in the run up to the switch over. There’s still plenty of time before this happens but it’s good to be prepared.

We look forward to extracting the most of the new features and data available within GA4 to continue to benefit our clients in search.

Cookie	Duration	Description
apbct_cookies_test	session	CleanTalk sets this cookie to prevent spam on comments and forms and act as a complete anti-spam solution and firewall for the site.
apbct_page_hits	session	CleanTalk sets this cookie to prevent spam on comments and forms and act as a complete anti-spam solution and firewall for the site.
apbct_prev_referer	session	Functional cookie placed by CleanTalk Spam Protect to store referring IDs and prevent unauthorized spam from being sent from the website.
apbct_site_landing_ts	session	CleanTalk sets this cookie to prevent spam on comments and forms and act as a complete anti-spam solution and firewall for the site.
apbct_site_referer	3 days	This cookie is placed by CleanTalk Spam Protect to prevent spam and to store the referrer page address which led the user to the website.
apbct_timestamp	session	CleanTalk sets this cookie to prevent spam on comments and forms and act as a complete anti-spam solution and firewall for the site.
apbct_urls	3 days	This cookie is placed by CleanTalk Spam Protect to prevent spam and to store the addresses (urls) visited on the website.
apbct_visible_fields	never	CleanTalk sets this cookie to prevent spam on the site's comments/forms, and to act as a complete anti-spam solution and firewall for the site.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
ct_checkjs	session	CleanTalk–Used to prevent spam on our comments and forms and acts as a complete anti-spam solution and firewall for this site.
ct_fkp_timestamp	session	CleanTalk sets this cookie to prevent spam on the site's comments/forms, and to act as a complete anti-spam solution and firewall for the site.
ct_pointer_data	session	CleanTalk sets this cookie to prevent spam on the site's comments/forms, and to act as a complete anti-spam solution and firewall for the site.
ct_ps_timestamp	session	CleanTalk sets this cookie to prevent spam on the site's comments/forms, and to act as a complete anti-spam solution and firewall for the site.
ct_sfw_pass_key	1 month	CleanTalk sets this cookie to prevent spam on comments and forms and act as a complete anti-spam solution and firewall for the site.
ct_timezone	session	CleanTalk–Used to prevent spam on our comments and forms and acts as a complete anti-spam solution and firewall for this site.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin to store whether or not the user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_hjAbsoluteSessionInProgress	1 hour	Hotjar sets this cookie to detect a user's first pageview session, which is a True/False flag set by the cookie.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
_ce.gtld	session	Crazyegg sets this cookie to identify the top-level domain.
_fbp	3 months	Facebook sets this cookie to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising after visiting the website.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_ga_VNVM83ZCVF	2 years	This cookie is installed by Google Analytics.
_gat_UA-*	1 minute	Google Analytics sets this cookie for user behaviour tracking.
_gat_UA-81041941-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjFirstSeen	1 hour	Hotjar sets this cookie to identify a new user’s first session. It stores the true/false value, indicating whether it was the first time Hotjar saw this user.
_hjRecordingEnabled	session	Hotjar sets this cookie when a Recording starts and is read when the recording module is initialized, to see if the user is already in a recording in a particular session.
_hjSession_*	1 hour	Hotjar sets this cookie to ensure data from subsequent visits to the same site is attributed to the same user ID, which persists in the Hotjar User ID, which is unique to that site.
_hjSessionUser_*	1 year	Hotjar sets this cookie to ensure data from subsequent visits to the same site is attributed to the same user ID, which persists in the Hotjar User ID, which is unique to that site.
_vwo_uuid_v2	1 year	This cookie is set by Visual Website Optimiser and calculates unique traffic on a website.
cebs	session	Crazyegg sets this cookie to trace the current user session internally.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
UID	2 years	Scorecard Research sets this cookie for browser behaviour research.

Cookie	Duration	Description
cto_bundle	1 year 24 days	Criterio sets this cookie to provide functions across pages.
i	1 year	This cookie is set by OpenX to record anonymized user data, such as IP address, geographical location, websites visited, ads clicked by the user etc., for relevant advertising.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
NID	6 months	NID cookie, set by Google, is used for advertising purposes; to limit the number of times the user sees an ad, to mute unwanted ads, and to measure the effectiveness of ads.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_ce.cch	session	Description is currently not available.
_ce.clock_data	1 day	Description is currently not available.
_ce.clock_event	1 day	Description is currently not available.
_ce.irv	session	Description is currently not available.
_ce.s	1 year	Description is currently not available.
_hjIncludedInSessionSample_432627	1 hour	Description is currently not available.
AnalyticsSyncHistory	1 month	No description
apbct_headless	session	No description
apbct_pixel_url	session	No description
cebsp_	session	Description is currently not available.
ct_checked_emails	session	No description
ct_has_scrolled	session	No description
ct_screen_info	session	No description
DEVICE_INFO	5 months 27 days	No description
li_gc	5 months 27 days	No description
ln_or	1 day	No description
loglevel	never	No description available.
optout	past	No description available.
VISITOR_PRIVACY_METADATA	6 months	Description is currently not available.
visitor-id	1 year	No description available.