Jekyll Robots Configuration

Sometimes we want to tell search engines to index our pages, but sometimes we don’t. Search Engine Optimization helps you make your pages appear at better location, while robots meta tag can prevent search engine to crawl certain pages. For my blog, I don’t want search engine to index the tag pages.

Ultimately, I want the following html meta tag appear in all the tag pages.

<meta name="robots" content="noindex" />

To achieve this result, we need to modify the front matter of tag pages, and add a few lines to the format of head section of pages.

Front Matter

For all tag pages, let’s add a new property robots to its front matter:

---
layout: tagpage
title: "Tag: hololens"
tag: hololens
robots: noindex
---

Then, for all tag pages, it has an additional property accessed via page.robots.

If you are curious about how to create tag pages, please visit this post.

Head Element

In the layout of the head section, usually at _includes/head.html, the following lines should be added to correctly add robots meta tag.

{% if page.robots %}
  <meta name="robots" content="{{page.robots}}" />
{% endif %}

Then, jekyll serve!

Thanks for reading!


Jekyll Tags on Github Pages

If you want to add tags to your Jekyll blog but found out that it is simply not supported by Github pages. You are at the right place here.

It is true that Github pages do not support customized Ruby plugins for Jekyll site. It is also true that the great jekyll-tagging is not one of Github default jekyll plugins.

As a result, we need to implement Jekyll tagging without plugins, it sounds hard, but here is a step-by-step guide to achieve so.

If you want to see live demo, you are already in a sample site that is hosted by Github, and has tags! Check out the source code: Github.

1. Add tags to your posts

In each of your post file (Markdown, presumably), you need to add the tags to the specific post in the front matter section. For example, for this post, the front matter looks like this:

---
layout: post
title: Jekyll Tags on Github Pages
description: blablabla
tags: jekyll blog github-page
---

In the tags entry, each tag is separated by white space. It is recommended to use lowercases for tag names.

2. Collect tags of all posts

What we need is a list named site.tags for our site. In order to do so, we can add a script to collect these tags using Liquid Templating of Jekyll.

In your _includes folder, add an html file called collecttags.html:

{% assign rawtags = "" %}
{% for post in site.posts %}
  {% assign ttags = post.tags | join:'|' | append:'|' %}
  {% assign rawtags = rawtags | append:ttags %}
{% endfor %}
{% assign rawtags = rawtags | split:'|' | sort %}

{% assign site.tags = "" %}
{% for tag in rawtags %}
  {% if tag != "" %}
    {% if tags == "" %}
      {% assign tags = tag | split:'|' %}
    {% endif %}
    {% unless tags contains tag %}
      {% assign tags = tags | join:'|' | append:'|' | append:tag | split:'|' %}
    {% endunless %}
  {% endif %}
{% endfor %}

This liquid scripts creates the list site.tags.

3. Execute the collection

You need to include the collecttags.html somewhere so that it is executed before site.tags are used.

My solution is to include it in head.html where Jekyll blogs defines the header. Google analytics script is also included here.

So, insert the following lines to _includes/head.html, inside head html section:

{% if site.tags != "" %}
  {% include collecttags.html %}
{% endif %}

This script runs the collecttags.html, and runs only once.

4. Display the tags of a post

We would like to display the tags of a post inside the post page, like this page. Then the modifications go to the format of posts: _layouts/post.

For example, the following code is inserted into the post layout:

<span>[
  {% for tag in page.tags %}
    {% capture tag_name %}{{ tag }}{% endcapture %}
    <a href="/tag/{{ tag_name }}"><code class="highligher-rouge"><nobr>{{ tag_name }}</nobr></code>&nbsp;</a>
  {% endfor %}
]</span>

I removed part of the style for this code snippets. You should adapt it to your own website design.

Note that a link is inserted to each of the tags of the post, at /tag/tag_name. We will come to that in the next step.

5. Generate the tag page

If you click on the tag displayed in this page, it will navigate you to a tag page, like this: hololens <- click it.

5.1. Define the layout of tag page

The tag page needs to be generated for all tags, because no plugin can be used to generate it when Github is building the page. Therefore, we treat the tag page as a kind of normal Jekyll page, with the only difference that it uses another layout.

OK. Now let’s create the layout of tag pages. Add another html file at _layouts/tagpage.html:

---
layout: default
---
<div class="post">
<h1>Tag: {{ page.tag }}</h1>
<ul>
{% for post in site.tags[page.tag] %}
  <li><a href="{{ post.url }}">{{ post.title }}</a> ({{ post.date | date_to_string }})<br>
    {{ post.description }}
  </li>
{% endfor %}
</ul>
</div>
<hr>

The above is the format of all tag pages. It displays all the posts that has the current tag, including post title, date and description.

5.2. Trigger the building of tag pages

In order to tell Github Jekyll engine to create a tag page, we need a markdown file specifying the specific tag to display and the above tagpage format.

Create a folder called tag in the root folder, and create a markdown file hololens.md like this:

---
layout: tagpage
title: "Tag: hololens"
tag: hololens
---

It triggers the building of a page at site_dir/tag/hololens/, with layout _layouts/tagpage.html, and parameter page.tag = hololens.

5.3. Automatic tag page creation (Optional)

If you would like to repeat step 5.2. for all the tags of your site, then you can skip this step.

Otherwise, I created a python script that crawls all the tags in the folder _posts/ and generate the desired tag page at tag/. It can be accessed here: tag generator.

You need to run this script before you push everything to Github repository and wait for your Github page.

6. Tag cloud display (Optional)

If you don’t want the tag cloud, you can skip this step.

Tag cloud can be like this: hololens <- click it (the bottom part). If you want such effect, you can also create the tag cloud without any plugin. You can change the size of the tag text depending on the number of posts related to it. Or you can display tags in the order of reference counts like my page.

What you need to do is to add another html page at _includes/archive.html:

<h2>Archive</h2>
{% capture temptags %}
  {% for tag in site.tags %}
    {{ tag[1].size | plus: 1000 }}#{{ tag[0] }}#{{ tag[1].size }}
  {% endfor %}
{% endcapture %}
{% assign sortedtemptags = temptags | split:' ' | sort | reverse %}
{% for temptag in sortedtemptags %}
  {% assign tagitems = temptag | split: '#' %}
  {% capture tagname %}{{ tagitems[1] }}{% endcapture %}
  <a href="/tag/{{ tagname }}"><code class="highligher-rouge"><nobr>{{ tagname }}</nobr></code></a>
{% endfor %}

This script fetches site.tags and sort them by the size of its referenced posts. The sorted list is in sortedtemptags. And it is iterated to be visualized. I am inspired by this stackoverflow answer.

You can add the following line to anywhere you want to display the tag cloud:

{% include archive.html %}

7. Push to Github

Done! Happy blogging!

If you like this post, please STAR my repository to motivate me!

Thanks for reading!