Jump to content

Page metadata

From Meta, a Wikimedia project coordination wiki
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

The following page and revision information is available in XML format in the XML/sql dumps. For the xml schema, see the appropriate version of the export xsd.

Page metadata

  • Title of the page
  • Namespace of the page
  • Page id (for old versions these are shown in the URLs of page history links)
  • If the page is a redirect, the title of the redirect

The entire page table dump is also available; see Database tables for more information about it.

Revision metadata

  • Id of the revision
  • If the edit was marked as a 'minor' revision by the editor
  • Date and time the edit was made
  • Username and user id, or IP address of the editor
  • Comment left by the editor when the edit was saved
  • Length in bytes of the revision content
  • Sha1 sum of the revision content
  • Revision id of the previous (parent) revision
  • Content model of the revision (is it wikitext? json?)
  • Content format of the revision

Additionally the id of the related entry in the text table is provided.

Content

In content dumps, almost all of the same metadata is provided, and the full content of included revisions is also written.

Not available in the XML files

Other metadata about a page is available in the aforementioned page table dump only, and includes:

  • If the page is protected
  • Whether the page is newly created or has more than one revision
  • Id of the most recent revision of the page
  • Length in bytes of the content
  • Content model of the page
  • Content language of the page

See also

See en:Wikipedia:Edit summary#Places where the edit summary appears for lists of edit metadata.

For details see the documentation of the MediaWiki database.

See also mw:Help:export, RDF metadata