Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
en:results:variables [29.03.2017 07:31] – [Dwell Times] cofrinen:results:variables [02.03.2018 08:30] – [Dwell Times] admin
Line 32: Line 32:
  
  
-===== Dwell Times ===== +===== Completion Times =====
- +
-**Note:** The handling times for each page are available if the option //Download the time spent per page// has been selected in variable selection. +
- +
-**Note:** If using ''goToPage()'' or one of the questionnaire pages does not display any content, the questionnaire (technically) displays two pages on one: The original page, and the page the questionnaire immediately jumps. In this case, the handling time is counted for the first (!) processed page -- even if that page does not contribute any content.+
  
   * **TIMEnnn** The variables ''TIME001'' etc. store the time (in seconds) that a participant stayed on a questionnaire page. If the participant visits the page multiple times (e.g., using the back button) these times are summed up. Generally, dwell times are rather imprecise, because they contain loading times.   * **TIMEnnn** The variables ''TIME001'' etc. store the time (in seconds) that a participant stayed on a questionnaire page. If the participant visits the page multiple times (e.g., using the back button) these times are summed up. Generally, dwell times are rather imprecise, because they contain loading times.
Line 42: Line 38:
     * they are longer than 2 hours or     * they are longer than 2 hours or
     * they exceed the page's dwell time media by more than 3 inter quartile ranges (IQR) divided by 1.34 (equals more than 3 standard deviations in a normally distributed sample)     * they exceed the page's dwell time media by more than 3 inter quartile ranges (IQR) divided by 1.34 (equals more than 3 standard deviations in a normally distributed sample)
 +  * **TIME_RSI** An index that indicates how much faster a participant has completed the questionnaire than the typical participant (median) has done. Values above 1 identify faster respondents, values below 1 slower respondents. Details see below.
 +
 +**Note:** The parameters ''TIME_SUM'' and ''TIME_RSI'' only contain a value if the downloaded data set contains at least 10 records for the respective questionnaire ([[:en:results:troubleshooting#selection_criteria_filter]]). The more records the download contains, the more accurate the values for ''TIME_SUM'' and ''TIME_RSI'' will be, because the distribution of response times in the sample is used to clean outliers or to normalize them.
 +
 +**Note:** The response times are only included in the data set if the option to download the dwell times has been checked the //variables// selection of the download options. This option is checked by default.
 +
 +**Note:** Processing times are recorded automatically. To deactivate the recording, please uncheck the option in **Survey Project** → **Project Settings** → tab //Privacy// → //record time and duration during the survey//.
 +
  
 ===== Quality Indicators ===== ===== Quality Indicators =====
Line 53: Line 57:
     *  When using [[:en:create:selection-textinput|Free text inputs within a selection]] (single or multiple choice selection), a option's void text input (e.g., "Other: %%___%%") is only counted as invalid data, if the appropriate option in the selection was selected.     *  When using [[:en:create:selection-textinput|Free text inputs within a selection]] (single or multiple choice selection), a option's void text input (e.g., "Other: %%___%%") is only counted as invalid data, if the appropriate option in the selection was selected.
   * **MISSREL** Percentage of missing answers weighted by the other participants answering behavior. Questions that are rarely answered (e.g., voluntary text questions) are mostly irrelevant for this value, questions that most participants have answered weight worse. The linear weighting factor for a question/item is the number of answers given to this question/item divided by how often the question/item has been asked.\\ **Note:** This value may vary, depeding on the subset of data retreived.   * **MISSREL** Percentage of missing answers weighted by the other participants answering behavior. Questions that are rarely answered (e.g., voluntary text questions) are mostly irrelevant for this value, questions that most participants have answered weight worse. The linear weighting factor for a question/item is the number of answers given to this question/item divided by how often the question/item has been asked.\\ **Note:** This value may vary, depeding on the subset of data retreived.
-  * **DEG_TIME** Negative points for extremely fast completion. This value is normed in such way that values of more than 100 points indicate low-quality data. Data quality, however, is no dichotomous attribute. If you prefer a more strict filtering, a threshold of 75 or even 50 points may as well be useful as a threshold of 200 for more liberal filtering. +  * **DEG_TIME** Negative points for extremely fast completion. This value is normed in such way that values of more than 100 points indicate low-quality data. Data quality, however, is no dichotomous attribute. If you prefer a more strict filtering, a threshold of 75 or even 50 points may as well be useful as a threshold of 200 for more liberal filtering. Note, that //TIME_RSI// is a more elaborate indicator for fast responding
-  +  * **TIME_RSI** This parameter is documented in more detail in the article [[https://www.researchgate.net/publication/258997762|Too Fast, too Straight, too Weird]] (as "relative speed index"). Records with a value greater than 1.6 should be considered more closely. From a value of 2.0 on, it is very unlikely that the participant has completed the questionnaire in a meaningful way. However, knowledge questions that the participant may have to investigate can distort the value (participants with good prior knowledge are faster).
-//LASTPAGE// and //FINISHED// show if the participant dropped out early. The percentage of missing answers (//MISSING// or //MISSREL//is a valuable indicator for the participant's carefulness and for data cases that stem from "just looking". The time required to do the survey (//TIME_SUM// and ///TIME_DEG//) is an inaccurare indicator for data quality -- but it reliably identifies cases where the participants did not even read the questions+
- +
-A detailed documentation on the indicators calculation is currently available in German, only: +
-[[http://forum.onlineforschung.org/viewtopic.php?p=13790#p13790|Maluspunkte]]+
  
-Note, that the quality indicators DEG_TIME and DEG_MISS have proven non-optimal during further research (especially their mean DEGRADE). In future, more elaborate quality indicator, as described in this working paper, shall become available in SoSci Survey: [[https://www.researchgate.net/publication/258997762|Too Fast, too Straight, too Weird]] +Whether a questionnaire has been completed in its entirety can be determined using the variables //LASTPAGE// and //FINISHED// (see above). //MISSREL// is valuable indicator of the participant's diligence and for data records that originate from "just looking at". The time invested for completion is not a direct indicator of data qualitybut very short response times (//TIME_SUM// and //TIME_RSI//) indicate that the questions were not even read. 
-===== External Information =====+
  
-The following variables will only be included in the data set if the appropriate option was enabled. Further, these data is recorded only if set so in the **Project Settings** => //Privacy// options. 
  
-{{:en:results:scr.variables.externals.png?nolink|Include the variables when downloading the data set}} 
  
-  * **S_IP** IP address of the participant [REMOTE_ADDR]. This may allow inferences on the location, but is completely useless to identify people who did the questionnaire twice. 
-  * **S_LANG** The language (e.g., "en" or "de") as set in the browser [HTTP_ACCEPT_LANGUAGE].\\ **Note:** This is nothing more than a browser setting that does not necessarily indicate the user's true language or residence. 
-  * **S_REFERR** Referer -– where did the participant came from [HTTP_REFERER]? Where did he or she find the link to the survey? 
-  * **S_BROWSR** The ID sent by the browser [HTTP_USER_AGENT]. Note that the participant could easily manipulate the browser ID. 
en/results/variables.txt · Last modified: 17.03.2020 20:24 by admin
 
Except where otherwise noted, content on this wiki is licensed under the following license: CC Attribution-Share Alike 4.0 International
Driven by DokuWiki