What Do You See in A Face?

An online experiment by Sonja van Kerkhoff
7 July - 22 August 2008
Thanks to Alex Reuneker and to Antanas Kaziliunas
for their technical expertise in creating the database.

Screenshot from a online questionnaire by Sonja van Kerkhoff
Screenshot from mid July 2008 looking at the final two responses.
Thanks to the 371 individuals who responded to this questionnaire, plus the 21 who helped me in testing this.
>> Go direct to the conclusions of this experiment.

Measuring Engagement
"...it appears to me that wonder is the first of all the passions; and it has no opposites, because if the object which presents itself has nothing in it that surprises us, we are in no wise moved regarding it, and we consider it without passion”
Rene Descartes, The Passions of the Soul, article 53


[cited: page 1, New Media, Old Media: A History and Theory Reader (2006) edited by Hui Kyong Chun, Wendy and Thomas W. Keenan.]



Rosalind Picard argues that engagement relates to our emotions which are cognitive and emotional, adding that emotion theorists still do not agree even on a definition of emotion. However, using the example of not being able to define whether Mt Everest is rock or ice, Picard argues that "we can base solid facts and knowledge on structures that are themselves imprecisely defined."

[page 21, Affective Computing (1997) by Rosalind Picard]



For the purpose of measuring engagement in a practical experiment I have chosen to focus on the time spent in a system and the observed interactivity or responsiveness of the user or viewer. The assumption here is that the further a person continues to interact with this questionnaire, the more engaged the user is, because they receive result related feedback after each question and they then must choose to stop or continue.


  Screenshot from a online questionnaire by Sonja van Kerkhoff
First page of the questionnaire: the introduction.


What do you see in a face?

To test my hypothesis that socially charged content is a significant factor in engagement, I created an online questionnaire based on the Myers- Briggs/Jung typology,

[see: humanmetrics.com for an explanation of what this is.]

a widely used personality test.
This was used so that the questionnaire would appear credible to the users, although the experiment is not in fact intended to test personality, but rather to see how long users continue to answer questions with one of the four different questionnaire types.

The first page also recorded the user's gender and age group, although this information was not used for the purposes of my questionnaire.

Each respondent randomly saw one of the four types of questionnaire, unaware that there were three other types.
To ensure this, IP addresses were recorded and respondents were not able to repeat the questionnaire. In doing this, individuals who shared the same IP address such as differing computers in a household or office, meant that no one could easily see the actual questionnaire twice.


   Screenshot from a online questionnaire by Sonja van Kerkhoff
Figure 1: The next page, Question 1 of questionnaire type 1.

  Screenshot from a online questionnaire by Sonja van Kerkhoff
Question 1 of Questionnaire type 4.



The title and question on each of the 72 pages of each questionnaire asks “What do you see in a face” and on one side there is either a drawing or a photograph of a face, with a caption which is 'socially charged' or 'neutral'.

*Type 1 (see Figure 1) each page showed a photograph and a 'socially charged' caption: a caption which identified the person and said something significant about them or their life.

*Type 2 showed the same photograph with a 'neutral' caption, approximately the same length as the matching 'socially charged' caption.

* Type 3 showed a sketch of the same face with the same 'socially charged' caption as type 1

* Type 4 (see Figure 2) showed a sketch of the face with the matching 'neutral' caption.


The purpose was to measure the effect of socially charged verses neutral captions.

To get an idea of the actual (and not merely statistical) significance of any effect found, two design variants (photograph or sketch) were also used, so that it would be possible to say that socially charged content is – for example – twice as effective as a change in design, or half as effective.

The two image variants also serve as a check: if only photographs were used, and respondents were found to quit the questionnaire in large numbers at – for example – question 25, this might be because that photograph was particularly offensive. It might then have been necessary to rerun the experiment without that photograph.
No markedly anomalous effects for individual questions were in fact found, but the photograph-sketch variants still gave a yardstick of how sensitive the experiment was to small differences in the questionnaire.

Screenshot from a online questionnaire by Sonja van Kerkhoff
Screenshot from a online questionnaire by Sonja van Kerkhoff

Left: Questionnaire Type 2, a photograph with a 'neutral' caption.   Right: After choosing continue in the previous screen, the participant sees their answer in proportion with all other answers so far. There is no indication of how many questions there are to go.

[Screenshots taken on 13 August 2008]

Screenshot from a online questionnaire by Sonja van Kerkhoff
Screenshot from a online questionnaire by Sonja van Kerkhoff

Left: The next screen after 'stop' has been pressed.
Right: The screen after this is confirmed shows all the answers up to 72 with the participant's answers marked in orange text up to the question they stopped at. Here you see the answers for questions 1 + 2.

[Screenshots taken on 13 August 2008]

Screenshot from a online questionnaire by Sonja van Kerkhoff
Screenshot from a online questionnaire by Sonja van Kerkhoff

Left: Detail of the final page for Types 3 and 4.   Right: Detail of the final page for Types 1 and 2.
No texts are in orange indicating that these are questions this participant didn't get to before stopping.
The individuals are: 49: Julia Roberts, 50: George W Bush and 51: Mrs. Imelda Marcos who were named in the 'socially charged' captions only.

[Screenshots are from 12th of July 2008.]

  Screenshot from a online questionnaire by Sonja van Kerkhoff
Type 4 had the following caption:
"She was born in Melbourne in 1939. Her father was a leading Australian insurance
executive. She has a degree in English and French language and literature."

Type 3 had the following caption:
"Germaine Greer's ideas have created controversy ever since her ground-breaking
book The Female Eunuch became an international best-seller in 1970."

  Screenshot from a online questionnaire by Sonja van Kerkhoff
Type 4 had the following caption:
" In primary school he received a prize for English and did well at sports
including cricket and boxing. He was a tall child."

Type 3 had the following caption:
"John Cleese performed in a Monty Python's Flying Circus sketch called
"The Semaphore Version of Wuthering Heights," "

  Screenshot from a online questionnaire by Sonja van Kerkhoff
Type 4 had the following caption:
"Later on she had three children and sang to them."

Type 3 had the following caption:
"Diana Ross recording the song "We Are The World." "

  Screenshot from a online questionnaire by Sonja van Kerkhoff
Type 4 had the following caption:
"He was born in Portland, Oregon USA. and grew up there, as the middle child.
His mother, Margaret, was a teacher before the birth of her children."

Type 3 had the following caption:
"Matthew Groening creator of The Simpsons, named the family after his own parents
and sisters—while Bart was an anagram of the word brat."

  Screenshot from a online questionnaire by Sonja van Kerkhoff
Type 4 had the following caption:
"She was born in England and is two years old."

Type 3 had the following caption:
"The future queen of England at age two in 1928."

  Screenshot from a online questionnaire by Sonja van Kerkhoff
Type 4 had the following caption:
"He was known to be an active sportsman from the time of his youth, playing rugby
and cricket. He was an avid golfer into his nineties."

Type 3 had the following caption:
"King Malietoa Tanumafili II, the world's 3rd longest-reigning monarch,
died in Samoa at 94 years of age in 2007."

  Screenshot from a online questionnaire by Sonja van Kerkhoff
Type 4 had the following caption:
"A woman dressed in Victorian dress in the 1890s. She worked with her brothers on a
committee and was articulate. She lived in a temperate climate."

Type 3 had the following caption:
"Meri Te Tai Mangakahia, member of the Maori Unity Party where women had equal
voting rights.The Maori Parliament was established in New Zealand in 1890."


The test was to measure how many questions were answered before the visitor left the system, so it was important that the questionnaire should appear credible, and that it was easy to leave the system at any moment.

Initially I tested this questionnaire on seven people in three countries, working in 3 operating systems, aiming to make the questionnaire long enough so as to be a little demanding to complete but not so long that it would lose credibility.

Through trial and error I settled on 72 faces, half being of females, and of known personalities in diverse fields or cultural backgrounds.

Because I relied on six types of user audiences in the e-mail lists for my participants I chose 'faces' known in the film world, art world, media / computer worlds, education, literature, music, New Zealand and Bahai worlds.

I invited feedback on the questionnaire in the invitation I emailed out.

I recorded the length of time (from question 1 until they clicked 'ok' after 'stop'), the final question they reached, the IP address, the type of questionnaire, gender, and age range.

I received responses from ip addresses from around the world (below I give a breakdown but didn't refer to this in my thesis).

For my graduation thesis I used results from between July 7th and August 22nd [not counting the participants I'd asked to check the questionnaire for its usability]. So I used 347 responses: 85, 94, 88 and 80 for types 1 to 4 respectively.

I received feedback related to the content of the questionnaire from 18 respondents, many of whom I knew personally. Most made responses in relation to the content of the questionnaire or in relation to the results they saw in relation to their own answers.
A few made the comment that they found it difficult to make a choice, not knowing if they should respond to the face or the captions.

To see if this was a significant factor, I added types 5 and 6 on July 19th.
These had photographs (type 5) or drawings (type 6) with no captions, and were intended to see whether the mere existence of captions was enough to change participant's behaviour.
I stopped this on August 22nd and had 24 responses in both types.
These few results indicated a pattern similar to the other types with six stopping at question 9 or less, and five finishing at question 72 with the rest spread thinly in between. The number of responses was too small for the numerical analysis applied to types 1 to 4.

I also observed 8 individuals in the U.K. undertaking the questionnaire in early August and found that all read the captions before making a choice. I noted that some stopped because they had to do something else, and not because they wished to leave. I had told them that part of my test was to see when they would leave the system, so that they wouldn't feel they had to complete the questionnaire because of my presence.

However given that all of these individuals as well as any person who did the questionnaire were allotted to the types randomly, this shouldn't effect the results between the types of questionnaire.

For all the results, I also recorded the length of time each visitor spent answering the questions, but chose not to use this because the number of questions answered is a more valid proxi measure for engagement. The time on-line could be affected by various events happening around each participant.




Demographic spread of those who responded

Added here for your interest. These results not used in my thesis experiment.

[7 July - 22 August 2008]

Aotearoa|New Zealand    24
Austria     5
Australia    11
Belgium       5
Brazil       1
Canada       7
Chile       1
China       5
Colombia       1
Croatia       1
Ecuador       1
Egypt       1
Figi       1
Finland       1
France       2
French Guiana       1
Germany      10
Greece       2
Hungary       1
India       1
Ireland       2
Italy       3
Liechtenstein     1
Malaysia     1
The Netherlands    196
North Korea     1
Norway     1
Poland     1
Romania     1
Slovakia       1
South Africa       1
South Korea     1
Sweden       2
Switzerland     1
Turkey       3
U.K.      27
U.S.A.      47

The table shows a total of 371 responses, including types 5 + 6 [not counted in my final analysis]. Roughly a third were affiliated to universities and it seems that from my network, those in the "Aotearoa Digital Arts" email list, a New Zealand folk list, and Bahai circles responded or possibly passed on my questionnaire the most.
The respondents in the Netherlands were spread across the country indicating to me that Bahais most likely spread this onto their acquaintances. The city or province, business or organization was also indicated in all of these addresses.



  Screenshot from a online questionnaire by Sonja van Kerkhoff


Results

The responses for each questionnaire were ranked according to the number of questions answered.

The most striking difference between the questionnaires was that those with a sketch achieved greater engagement (measured by the final question answered) than those with a photograph.


 With  Photo- graphs With  Sketches
Number who stopped at Question 64 or higher2229



I was surprised to discover in the questionnaire that sketches had such a significantly positive influence on the engagement, while the “socially charged” captions had a modest positive effect on median scores, and did not significantly affect the numbers who continued to the very end of the questionnaire.
And it appears that socially charged content may well be a factor in achieving greater sustained user engagement (explained below), although it is considerably less important than design features.


Detail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van Kerkhoff
Socially charged caption:
q. 61:
In 1965 Stephanie Kwolek invented Kevlar, a synthetic fiber used in bullet-proof vests, airplanes, ropes, boats, cables, tires, etc.
Neutral caption:
q. 61:
From her mother, first a homemaker and then by necessity a career woman, she inherited a love of fabrics and sewing.
Socially charged caption:
q. 14:
Tim Berners-Lee, who invented the World Wide Web, poses in his MIT office.
Oct. 2, 1995
www.time.com/time
Neutral caption:
q. 14:
He has just gotten up and is about to have some breakfast. Then he will go to work.
The date is October 2nd, 1995.



I broke the results into groups of nine (so questions 64-72 are the final nine questions) to make this more manageable and because I noted marked changes in responses from question 9 onwards not only in numbers but in distinctions between the types.
The table below uses 371 responses [347 for just Types 1 to 4.] from between July 7th and August 22nd 2008.

Final Question Answered  Type 1           
 Photo + socially  charged caption
 Type 2           
 Photo + neutral  caption
 Type 3           
 Sketch +
 socially  charged caption
 Type 4            
 Sketch +
 neutral caption
 Type 5            
 Photo,
 no text
 Type 6            
 Sketch,
 no text
 Total 
1 - 9  23 22 27 22 2 4 100
10 - 18  14 9 11 13 0 4 51
19 - 27  12 23 14 12 1 4 66
28 - 36  10 11 6 4 0 2 33
37 - 45  7 4 2 4 0 0 17
46 - 54  4 7 8 9 0 0 28
55 - 63  6 5 3 4 2 0 20
64 - 72  9 13 17 12 3 2 56
Total  85 94 88 80 8 16 371



The results for 24 respondents who left the system without answering the first question were discarded, on the assumption that these were accidental participants who left because they did not mean to be there in the first place: their leaving would not relate to the type of questionnaire they had encountered.
For each questionnaire, the median of the responses was found: if for example there were 81 responses, the number of questions answered by the 40th ranking respondent was taken as the engagement score for that type of questionnaire. If there were 80 responses, the average of the questions answered by the 39th and 40th respondent would be the score. This gave the following results:

For Questions 1-72  Median for the final question answered for charged contentMedian for the final question answered for neutral content
Photograph 23  24 
Sketches 24.5  23 



These differences are not significant, given that the number of questions answered is necessarily a whole number.
It is notable that the engagement for the photograph and sketch variants also do not differ, although the distribution of the most tenacious respondents (22 versus 29, described above) indicates that this is a large factor.

This suggested that uninterested respondents may click on a number questions before leaving, rather than just one, the number not relating to the content or design of the questionnaire. If we discard all the responses which stopped at or before question 9, the picture is quite different:

For Questions 10-72  Median for the final question answered for charged contentMedian for the final question answered for neutral content
Photograph 32  40 
Sketches 30  36 



Here it appears that the engagement scores for the versions with sketches are higher than those with photographs, but also that the two scores for the versions with charged content are both higher than those for the same design but with neutral content.

I then recorded the number of responses for those stopping between question 1 and 15

Final Question Answered   1     2      3      4      5      6      7      8      9     10    11    12    13    14    15  
Type 1 431421421420202
Type 2 221432125220002
Type 3 733511721112221
Type 4 102544213321111
Total 148718108137101073536



There is a marked drop off in responses from Question 11 onwards, so to test the sensitivity of this analysis,
I did the same analysis as above but started with results for Question 16 onwards which gave the following results.

For Questions 16-72  Median for the final question answered for charged contentMedian for the final question answered for neutral content
Photograph 38  32 
Sketches 50  47 


These results show a similar pattern as the Table showing the median for Questions 10-72.
It appears that socially charged content may well be a factor in achieving greater sustained user engagement, while neither design aspects nor captions had a detectable effect on initial engagement.


If we return to the most tenacious respondents, those who stopped at Question 64 or higher, where 22 had answered questionnaires using photographs, and 29 had answered questionnaires with sketches. The breakdown for socially charged versus neutral content for the same 51 respondents is, surprisingly, almost the same:

  Socially charged captions  
(Type 1 + 3) 
  Neutral captions
(Type 2 + 4)
Number who stopped at Question 64 or higher   26  25 



The higher median scores for socially charged captions are not reflected in this last grouping. Why not? One answer is that 51 respondents is a fairly small group to give results. This is the advantage of using median scores: they show the engagement of all the participants included rather than of the most engaged group.
But another answer is that socially charged content gives greater engagement in the middle range – ignoring the results of the first 9 questions – but for really sustained engagement, to the bitter end, use Art! The drawings were much more effective than the photographs (29 vs 22).

Detail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van Kerkhoff
Socially charged caption:
q. 52:
Dizzy Gillespie, a Bahá'í since 1970, was almost inseparable from his bent trumpet.
Neutral caption:
q. 52:
He enjoyed a good joke, made people happy, and liked to have fun on the job.
Socially charged caption:
q. 35:
The deep-voiced and statuesque, Phyllis Hyman sang with an emotional intensity rare for female vocalists.
Neutral caption:
q. 35:
She had a lot of energy and an extraordinaire sense of presence along with a no-nonsense attitude.

Summary + Conclusions

I was surprised to discover in the questionnaire that sketches had such a significantly positive influence on the engagement, while the “socially charged” captions had a modest positive effect on median scores, and did not significantly affect the numbers who continued to the very end of the questionnaire. The sketches, in general, were more or less copies of the photograph, so I considered this a minor change in the design of the system.
This indicates that this questionnaire method is quite sensitive to variants in the questionnaire.

The much better scores for the sketches confirm that mediation – the visibility of the medium as opposed to its invisibility – is a positive factor. Put the other way, a more realistic simulation could have a negative effect on engagement. Perhaps, as imaginative beings and symbol-users, we actually like filling in the gaps and seeing the hand of the maker in the signs presented to us?

I also had expected that initial curiosity would result in a drop out after one or two questions, but the results showed that this did not occur until about question 9. This could be due to the aesthetics of the questionnaire or the interest people might show towards close-ups of faces. I came across a few comments in the literature suggesting that the general public today reads more visually (via videos, film or poster design) than in the pre-digital age, and this is a topic that could be pursued in the future.

Detail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van KerkhoffDetail from a online questionnaire by Sonja van Kerkhoff
Socially charged caption:
q. 20:
Years later, Mahatma Gandhi, was a political and spiritual leader of India and the Indian independence movement.
Neutral caption:
q. 20:
He studied law and then worked as a barrister in England. He was married with one son at the time this photograph was taken.
Socially charged caption:
q. 45:
Sonja van Kerkhoff at age 7
at her First Holy Communion in Manaia, Taranaki, New Zealand.
Neutral caption:
q. 45:
She needed to wear glasses from an early age. She likes to draw and grew up on a dairy farm.
Would he prefer:
an unplanned event that was familiar
an unplanned event that was unfamiliar
Which of the following would she find more of a compliment?
That is a very sentimental person
That is a very logical person


Potential Ramifications

The questionnaire showed there is some significance in relation to charged or meaningful content when it comes to (relatively) sustained engagement. This was what I was interested in: an engagement that goes beyond user interaction. I chose to test an aspect relating to content because my critique in the development of interactive media was not focused on issues of interactivity or even immersiveness per se, but on purpose, content and emotional effect. This is in part a reaction to the focus on technique that seems to be a popular trend among practitioners in digital arts worlds. It seems to be me almost an uncritical approach. It is our subjectivity that makes things interesting and sustains engagement.

My masters thesis, the context for this questionnaire, is here >> Engaging Media: issues of engagement in digital media art projects (26 page PDF, 930kb) (This opens in a new window)



>> Sonja's c.v.    Written by Sonja   work by Sen + Sonja    Latest Art Work    Find by Medium