Skip to main content

Fields

field_namedatatypedeprecated³accepted dataformatrequiredwritabledefaultExample
urlstringnourl¹yesyes-"url": "http://www.example.com/example.html"`
publishedlongnotimestamp in msyesyes-"published": 1392821902000
titlestringno<500 charsnoyes-"title": "Lorem ipsum dolor"
contentstringno<50,000 charsyesyes-"content": "Lorem ipsum dolor sit amet, consectetur adipiscing elit"
indexedlongno-nono-"indexed": 1392821902000
search_indexedlongno-nono-"search_indexed": 1392821902000
title_snippetstringno-nono-"title_snippet": "Lorem ipsum dolor"
content_snippetstringno-nono-"content_snippet": "Lorem ipsum dolor sit amet, consectetur adipiscing elit"
root_urlstringno-nonoextracted from url"root_url": "https://www.example.com/"
domain_urlstringno-nonoextracted from url"domain_url": "http://example.com/"
host_urlstringno-nonoextracted from url"host_url": "http://www.example.com/"
parent_urlstringno-noyes-"parent_url": "https://www.example.com/example.html"
langstringno2 char isonoyesdetected from content"lang": "de"
porn_levelintegerno0..100noyes-"porn_level": 25
fluency_levelintegeryes0..100noyes-"fluency_level": 42
spam_levelintegeryes0..100noyes-"spam_level": 31
noise_levelintegerno0..100noyes-"noise_level": 77
noise_categorystringnosee list²noyes-"noise_category": "promotions"
sentimentintegerno-5..5noyes0"sentiment": -2
reachintegerno>0noyes-"reach": 31415926
engagementintegerno>0noyes-"engagement": 271828
ratingintegerno0..10noyes-"rating": 7
fakenews_levelintegerno0..100noyes-"fakenews_level": 77
providerstringnoa-z0-9_ <100 charsnoyes-"provider": "company X"
source_typelist of stringnosee list²noyes"OTHER""source_type": "ONLINENEWS_NEWSPAPER"
post_typelist of stringnosee list²noyes"TEXT""post_type": "TEXT"
cluster_idstringno-nono--
meta_cluster_idstringno-nono--
tags_internallist of stringno-nono-"tags_internal": ["hasComment", "hasImage"]
tags_markinglist of stringnosee list²noyes-"tags_marking": ["important", "read"]
tags_customerlist of stringnosee⁴noyes-"tags_customer": ["tag1", "tag2"]
tags_pluginlist of stringnosee⁴noyes-"tags_plugin": ["tag1", "tag2"]
matched_querystringnonono-
matched_profilestringnonono-
imageslist of imagenoimage objectnosee below-
videoslist of videonovideo objectnosee below-
article_extended_attributesarticle_extended_attributesnoarticle_extended_attributes objectnosee below-
source_extended_attributessource_extended_attributesnosource_extended_attributes objectnosee below-
extra_article_attributesextra_article_attributesnoextra_article_attributes objectnosee below-
extra_author_attributesextra_author_attributesnoextra_author_attributes objectnosee below-
extra_source_attributesextra_source_attributesnoextra_source_attributes objectnosee below-
customer_entitieslist of customer_entitynocustomer_entities objectnosee below-
entity_urllist of entitiesnonono-"entity_url": [{"url": "pic.twitter.com/ex1"}, {"url": "https://twitter.com/ex2"}]
word_countintegerno>0nono-"word_count": 664
copyrightstringnonoyes-"copyright": "Copyright 2019, example.com, All Rights Reserved."

See the chapter on Protocols, Encodings and Value Field Options for possible values for the fields sourcetype, lang, or geo.

¹ Cannot be changed after creating a new document.

² See list of value options.

³ Deprecated fields values are not used anymore by the backend. These fields can be removed in a future release.

tags_customer: a-zA-Z0-9-_ or space, supports hierarchy using /, can only be set in project specific documents, not in general document import. tags_plugin : have to be in the form <vendor_id>_<vendor_field>:<value>

Evolution and stability of document fields

The structure of the documents will not be changed. Existing fields will not be removed and their formatting will not be changed. Occasionally, new fields will be added to the documents and the order of fields can change, please take this into account when implementing a custom client.