Skip to main content

Demographics

Gender

This diagram shows the rpartition of authors for all documents which match the project criterias within the filtered period by gender when it can be determined:

Gender

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 23.7% of the document (where the gender is known) written by women, 76.3% by men.

To retrieve this result, we can use gender histogram:

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/gender?access_token=<access_token>&q=published:>1633039200000 AND published:<1633644000000'

Gender - API result

note

The results come with the gender UNKOWN when the gender cannot be determined.

To retrieve percentages, we should apply this formula:

Percent of women = #FEMALE / (#FEMALE + #MALE)
Percent of men = #FEMALE / (#FEMALE + #MALE)

In our sample :

  • v is the number of documents that match the filter criteria and the gender. 397 / (397 + 1277) = 397 / 1674 = 0.2371 = 23.7% of women.
  • ks is the gender.
  • total_hits: Total number of documents match the filter criteria.

Age

This diagram shows the repartition of authors for all documents which match the project criterias within the filtered period by age when it can be determined:

Age

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 136 of the document (where the age is known) where authors are between 18 and 24 years old.

To retrieve this result, we can use age histogram:

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/age?access_token=<access_token>&q=published:>1633039200000 AND published:<1633644000000'

Age - API result

In our sample :

  • v is the number of documents that match the filter criteria and the age. Here we have 136 documents where the authors are between 18 and 24 years old.
  • ks is the age group.
  • total_hits: Total number of documents match the filter criteria.

Top countries

This diagram shows the repartition of source sites for all documents which match the project criterias within the filtered period by country:

Top countries

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 263 documents published on a site located in UK.

To retrieve this result, we can use country histogram:

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/country?access_token=<access_token>&q=published:>1633039200000 AND published:<1633644000000'

Top countries - API result

In our sample :

  • v is the number of documents that match the filter criteria and the country. Here we have 263 documents published on a site located in the UK.
  • ks is the country code (ISO 2 letters country codes).
  • total_hits: Total number of documents match the filter criteria.

Top languages

This diagram shows the repartition of the languages used in all documents which match the project criterias within the filtered period:

Top languages

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 28 documents written in German.

note

By default, if the language is not detected on the document, it will be English.

To retrieve this result, we can use language histogram:

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/language?access_token=<access_token>&q=published:>1633039200000 AND published:<1633644000000'

Top languages - API result

In our sample :

  • v is the number of documents that match the filter criteria and the country. Here we have 28 documents written in German.
  • ks is the language code (ISO 2 letters code).
  • total_hits: Total number of documents match the filter criteria.

Top family status

This diagram shows the repartition of the authors' family status for all documents which match the project criterias within the filtered period when it can be determined:

Top Family Status

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 16 documents written by married authors.

Warning

By API, it’s not possible to get it in one call. You need to make a call for each family status.

To retrieve this result, we can use published histogram with the parameter q setted to demographic filter metric you want to retrieve. It can be one of these values: familystatus-parents, familystatus-single, familystatus-married or familystatus-senior

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/published?access_token=<access_token>q=demographic:familystatus-married&interval=2w&min=1633039200000&max=1633644000000'

Top Family Status - API result

In our sample :

  • t is the timestamp (beginning of the period).
  • v is the number of published documents which match the filter criterias in the filtered period and the demographic filter. we got 16 documents written by married authors (when it can be determined) in the period.

Top interests

This diagram shows the repartition of the authors' interests for all documents which match the project criterias within the filtered period when it can be determined:

Interests

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 48 documents where authors have an interest on "Music & Audio".

To retrieve this result, we can use interest histogram:

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/interest?access_token=<access_token>&q=published:>1633039200000 AND published:<1633644000000'

Interests - API result

In our sample :

  • v is the number of documents that match the filter criteria and the interest code. Here we have 48 documents where the authors are interested in music & audio (when it can be determined).
  • ks is the interest code.
  • total_hits: Total number of documents match the filter criteria.

Top occupations

This diagram shows the repartition of the authors' occupations for all documents which match the project criterias within the filtered period when it can be determined:

occupations

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 28 documents where authors were identified as "Executive manager".

To retrieve this result, we can use occupation histogram:

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/occupation?access_token=<access_token>&q=published:>1633039200000 AND published:<1633644000000'

occupations - API result

In our sample :

  • v is the number of documents that match the filter criteria and the occupation code. Here we have 28 documents where the authors are executive managers (when it was determined).
  • ks is the occupation code.
  • total_hits: Total number of documents match the filter criteria.

Share of individual/business

This diagram shows the repartition of the authors' type (individual or company) for all documents which match the project criterias within the filtered period when it can be determined:

Share of individual/business

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 88 documents from authors where the type was identified as "company".

To retrieve this result, we can use published histogram with the parameter q setted to author's type you want to retrieve. It can be one of these values: individual, business.

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/published?access_token=<access_token>&q=authorcategory:business&interval=2w&min=1633039200000&max=1633644000000'

Share of individual/business - API result

In our sample :

  • t is the timestamp (beginning of the period).
  • v is the number of published documents which match the filter criterias in the filtered period and the autjorcategory filter. we got 88 documents written by authors identified as company.

Share of automated accounts

This diagram shows the number of documents which match the project criterias within the filtered period and when the post is identified as posted from an automatic account.

Share of automated accouts

In this sample, over the filtered period (between the 1st of October and the 8th of october), we got 71 documents from "automated" accounts.

To retrieve this result, we can use published histogram with the parameter q setted with the tag is:automated_post or not.

curl -L -X GET 'https://api.talkwalker.com/api/v1/search/p/<project_id>/histogram/published?access_token=<access_token>&q=is:automated_post&interval=2w&min=1633039200000&max=1633644000000'

Share of automated accouts - API result

In our sample :

  • t is the timestamp (beginning of the period).
  • v is the number of published documents which match the filter criterias in the filtered period and the tag. we got 71 documents tagged as an automated post.