Creating RSS Feeds
- What are the requirement for the specified tracking page so that crawler could index my website?
- What social networks are supported by Mysitemapgenerator?
- How does automatic feed creation work?
- How Mysitemapgenerator crawls Markup and Structured Data?
- How is the formation of full-text RSS in the format for export to Google Publisher Center?
- How occurs RSS forming in the format of data export to Yandex News?
- How do I let visitors know that I have an RSS feed?
What are the requirement for the specified tracking page so that crawler could index my website?
Specified tracking page should be accessible, return HTML-code and contain links to the internal pages (if the search pattern is specified – corresponding links). Source page can also contain intrahost server redirect, which will be processed.
How does automatic feed creation work?
The feed is generated based on the content received from your site. The source can be Schema.org markup: Article, NewsArticle, BlogPosting, LiveBlogPosting and Open Graph Article, as well as standard HTML document tags.
When microdata is not detected on your site, Mysitemapgenerator tries to automatically determine the content of publications using its own web page extraction algorithm. Depending on the structure of your website, you can select the appropriate parameters in the generator settings. To achieve a better result, we recommend that you use Schema.org micro-markup on your site.
How Mysitemapgenerator crawls Markup and Structured Data?
Supported Schema.org markup types: Article, NewsArticle, BlogPosting, LiveBlogPosting:
- headline - Article title.
- description - Article short description.
- articleBody - Article text.
- image - The URL of an image resource .
- category - Article category.
- datePublished - Date.
Schema.org markup sample for news articles:
<div vocab="http://schema.org/" typeof="NewsArticle"> ... <h1 property="headline">Article title</h1> <span property="description">Article short description</span> <img itemprop="image" src="http://youwebsiteurl/pictures/thumbnail.jpg" /> <span itemprop="datePublished" content="1998-09-04T19:25:33+00:00">Today, 19:25</span> ... <div property="articleBody"> <p>Article full contents</p> ... </div> ... </div>
Schema.org markup sample for any article:
<div itemscope itemtype="http://schema.org/Article"> <h1 itemprop="headline">Заголовок публикации</h1> <div itemprop="description">Аннотация к публикации</div> <div itemprop="articleBody"> <p>Article full contents</p> ... </div> </div>
Open Graph (Supported type: Article):
- og:title — Article title.
- og:description — Article short description.
- og:image — The URL of an image resource.
- article:published_time - Date.
Open Graph markup example:
<meta property="og:type" content="article" /> <meta property="og:title" content="Article title" /> <meta property="og:description" content="Article short description" /> <meta property="og:image" content="http://youwebsiteurl/pictures/thumbnail.jpg" /> <meta property="article:published_time" content="1998-09-04T19:25:33+00:00" />
An example of pure HTML without microdata markup:
... <head> ... <title>Article title</title> <meta name="description" content="Article short description" /> ... </head> <body> ... <article> <p>Article full contents</p> ... </article> ... </body> ...
How is the formation of full-text RSS in the format for export to Google Publisher Center?
In addition to standard RSS 2.0 elements, the special element "content:encoded" is required to export data to Google Publisher Center, which contains the full contents of the publication:
<content:encoded>Full contents of the publication</content:encoded>
To include the full content of publications, the feed must use the additional namespace:
xmlns:content="http://purl.org/rss/1.0/modules/content/"
Our robot extracts the full text of publications on web pages using a special algorithm, as well as on pages marked with Schema.org markup (supported types: Article, NewsArticle, BlogPosting, LiveBlogPosting).
How occurs RSS forming in the format of data export to Yandex News?
Along with standard RSS 2.0 elements, in order to export data into Yandex a special elemnt "yandex-fulltext" is needed, which should contain full contents of the publication:
<yandex-fulltext>Full contents of the publication</yandex-fulltext>
To include the full content of publications, the feed must use the additional namespace:
xmlns:yandex="http://news.yandex.ru"
Full text should not contain the name of the source and date/time of the message, and contact information or any other information, which is repeated in every message.
How do I let visitors know that I have an RSS feed?
The easiest way is to create a link to your RSS feed and place it on a website. By clicking on it, visitors will be able to subscribe to updates through their RSS reader.
<a href="URL**">Subscribe via RSS</a>
Another way is to add a following meta-tag to the <head> section of the HTML-page:
<head> … <link rel="alternate" type="application/rss+xml" title="Feed Title*" href="URL**"> … </head>
* "Feed Title" - RSS feed title,
** "URL" - Destination feed URL.
If you have this tag, some browsers add a special icon, which is a link to your RSS feed, located in the right corner of the address bar.
What social networks are supported by Mysitemapgenerator?
Mysitemapgenerator supports Twitter, Youtube, VK and Reddit. Only public content is available for indexing.