neuroQWERTY MIT-CSXPD Dataset

The new PhysioNet website is available at: https://physionet.org. We welcome your feedback.

This data set is used and described in

L. Giancardo, A. Sánchez-Ferro, T. Arroyo-Gallego, I. Butterworth, C. S. Mendoza, P. Montero, M. Matarazzo, J. A. Obeso, M. L. Gray, R. San José Estépar. Computer keyboard interaction as an indicator of early Parkinson's disease. Scientific Reports 6, 34468; doi: 10.1038/srep34468 (2016)

When referencing this material, please include the citation above, and also include the standard citation for PhysioNet:

Goldberger AL, Amaral LAN, Glass L, Hausdorff JM, Ivanov PCh, Mark RG, Mietus JE, Moody GB, Peng C-K, Stanley HE. PhysioBank, PhysioToolkit, and PhysioNet: Components of a New Research Resource for Complex Physiologic Signals. Circulation 101(23):e215-e220 [Circulation Electronic Pages; http://circ.ahajournals.org/content/101/23/e215.full]; 2000 (June 13).

The neuroQWERTY MIT-CSXPD database contains keystroke logs collected from 85 subjects with and without parkinsons disease (PD). This dataset has been collected and analyzed in order to indicate that the routine interaction with computer keyboards can be used to detect motor signs in the early stages of PD.

Data Collection

The subjects were recruited from two movement disorder units in Madrid (Spain) following the institutional protocols approved by the Massachusetts Institute of Technology, USA (Committee on the Use of Humans as Experimental Subjects approval no. 1402006203), Hospital 12 de Octubre, Spain (no. CEIC:14/090) and Hospital Clinico San Carlos, Spain (no. 14/136-E).

Each data file collected includes the timing information collected during the sessions of typing activity using a standard word processor on a Lenovo G50-70 i3-4005U with 4MB of memory and a 15 inches screen running Manjaro Linux. Subjects were instructed to type as they normally would do at home and they were left free to correct typing mistakes only if they wanted to. The key acquisition software presented a temporal resolution of 3/0.28 (mean/std) milliseconds.

There are two datasets collected from two sets of experiments:

  1. PD_MIT-CS1PD - 31 subjects. 13 healthy controls and 18 PD sufferers. Subjects were asked to visit a movement disorder unit twice to complete the study. Therefore each subject's data is stored in 2 csv files.
  2. PD_MIT-CS2PD - 54 subjects. 30 healthy controls and 24 PD sufferers. Subjects were asked to visit a movement disorder unit once to complete the study.

Along with the raw typing collections, clinical evaluations were also performed on each subject, including UPDRS and finger tapping tests. See the referenced publication for more details.

Data Files

The data from each of the two experiment sets are split into their own subdirectories. Each dataset contains a subject summary csv file GT_DataPD_MIT-CSXPD.csv which lists for each subject:

Each keystroke data csv file has four columns which give:

The neuroQWERTY.zip file includes all of the data along with the scripts described in the next section.

Loading Scripts

The nqDataLoader.py python module contains functions used to filter anomalous results and load the data from the csv data files. The readme.ipynb ipython notebook uses these functions and demonstrates how to load and display the data.

Acknowledgements

These datasets have been collected as part of the neuroQWERTY project at the Massachusetts Institute of Technology thanks to the financial support by the Comunidad de Madrid, Fundacion Ramon Areces and The Michael J Fox Foundation for Parkinson's research (grant number 10860). We thank the M + Vision faculty for their guidance in developing this project. We also thank our many clinical collaborators at MGH in Boston, at “12 de Octubre”, Hospital Clinico and Centro Integral en Neurociencias HM CINAC in Madrid for their insightful contributions.

Icon  Name                    Last modified      Size  Description
[PARENTDIR] Parent Directory - [DIR] MIT-CS1PD/ 2016-12-20 11:07 - [DIR] MIT-CS2PD/ 2016-12-20 11:07 - [   ] readme.ipynb 2016-12-20 11:07 77K [   ] neuroQWERTY.zip 2016-12-20 11:07 2.1M [TXT] nqDataLoader.py 2016-12-20 11:07 9.8K [   ] MD5SUMS 2016-12-20 11:44 194 [   ] SHA1SUMS 2016-12-20 11:44 226 [   ] SHA256SUMS 2016-12-20 11:44 322 [   ] DOI 2017-01-05 15:56 19

Questions and Comments

If you would like help understanding, using, or downloading content, please see our Frequently Asked Questions.

If you have any comments, feedback, or particular questions regarding this page, please send them to the webmaster.

Comments and issues can also be raised on PhysioNet's GitHub page.

Updated Friday, 28 October 2016 at 16:58 EDT

PhysioNet is supported by the National Institute of General Medical Sciences (NIGMS) and the National Institute of Biomedical Imaging and Bioengineering (NIBIB) under NIH grant number 2R01GM104987-09.