DATA

4.71 TB

USERS

105,852

RECORDINGS

4,717,215

COUNTRIES

50+

XR DEVICES

30+

FORMAT

.XROR

2023  |  Vivek Nair · Wenbo Guo · Rui Wang · James F. O’Brien · Louis Rosenberg · Dawn Song

The BOXRR-23 dataset contains 4,717,215 motion capture recordings generated by 105,852 real users of extended reality (XR) devices. We sourced these recordings from a number of broadly publicly-available sources, converted them all into a single format, combined them with additional metadata from further open-access APIs, removed identifiable user data, and finally packaged the recordings into a unified dataset for use by the research community. The dataset totals 4.7 TB in compressed size, and expands to over 8.0 TB of raw data. For ease of access, we have split the data into 106 chunks, each containing up to 1,000 users, with an average size of 45 GB per chunk.

As seen in:

  • Vivek Nair, Wenbo Guo, Justus Mattern, Rui Wang, James F. O’Brien, Louis Rosenberg, and Dawn Song. "Unique Identification of 50,000+ Virtual Reality Users from Head & Hand Motion Data." arXiv, 17 February 2023. doi:10.48550/arXiv.2302.08927.
  • Vivek Nair, Gonzalo Munilla Garrido, and Dawn Song. "Going Incognito in the Metaverse." arXiv, 11 August 2022. doi:10.48550/arXiv.2208.05604.
  • Vivek Vivek Nair, Christian Rack, Wenbo Guo, Rui Wang, Shuixian Li, Brandon Huang, Atticus Cull, James F. O'Brien, Louis Rosenberg, and Dawn Song. "Inferring Private Personal Attributes of Virtual Reality Users from Head and Hand Motion Data." arXiv, 30 May 2023. doi:10.48550/arXiv.2305.19198.

Supported by:

Berkeley RDI Logo NSF Logo NPSC Logo Hertz Foundation Logo

BOXRR-23 is used by dozens of institutions, including:

University of California, Berkeley Dongguk University Korea Advanced Institute of Science & Technology NC State University University of North Carolina at Pembroke Sungkyunkwan University Universität Würzburg

Dataset nutrition facts pursuant to the The Data Nutrition Project, https://arxiv.org/abs/1805.03677

Dataset Facts


Dataset BOXRR-23

Instances Per Dataset 4,717,215


Metadata


Original Authors

Vivek Nair, UC Berkeley

Wenbo Guo, UC Berkeley

Rui Wang, UC Berkeley

James F. O'Brien, UC Berkeley

Louis Rosenberg, Unanimous AI

Dawn Song, UC Berkeley


Owner

Berkeley RDI Center


Creator

Berkeley RDI Center


Maintainer

Berkeley RDI Center


Version

2023


URL

rdi.berkeley.edu/metaverse/boxrr-23


DOI

doi.org/10.25350/B5NP4V


License

CC BY-NC-SA 4.0


Curated

APR 2023


Original Funding

National Science Foundation

National Physical Science Consortium

Fannie and John Hertz Foundation

Berkeley RDI Center


Ongoing Funding

Berkeley RDI Center


Keywords

XR, VR, AR, MR, MoCap, HCI, CGI, AI, ML


Composition


Data Dictionary

rdi.berkeley.edu/metaverse/boxrr-23/dict.json


Format

XROR


Timeframe

    From

    To

 

NOV 2017

APR 2023


Upstream Sources

BeatLeader (beatleader.xyz)

ScoreSaber (scoresaber.com)

PolyGone (polygone.art)

Steam (steampowered.com)

BeatSaver (beatsaver.com)


Source

% of Recordings


BeatLeader 3,525,456 recordings

ScoreSaber 1,136,581 recordings

PolyGone 55,178 recordings

74.7%

24.1%

1.2%

Ethics


Ethics Review

Berkeley OPHS #2023-03-16120


Human Data

Yes


Individual Data

Yes


Consent Given

Yes


Community Involvement

Yes


Sensitive Content

Maybe


Confidential Data

No


Subpopulations

Country


Restrictions

rdi.berkeley.edu/metaverse/boxrr-23/dua.pdf


Processing


Imputation

None


Manipulation

None


Completeness

Complete


Raw Data Retained

Yes


Uses and Distribution


Domains

Security and Privacy

Graphics and CGI

Human-Computer Interaction

Machine Learning


Original Use

Authentication


Notable Uses

arxiv.org/abs/2302.08927

arxiv.org/abs/2208.05604

arxiv.org/abs/2305.19198


Other Uses

Motion Synthesis

Anti-Cheating

Score Prediction


Prohibited Uses

Deanonymization

Sensitive Attributes

Health Research


Maintenance and Evolution


Corrections or Erratum

None


Updates

Annual


Description


The BOXRR-23 dataset contains 4,717,215 motion capture recordings generated by 105,852 real users of extended reality (XR) devices, obtained from three broadly publicly-available sources relating to two XR applications, Beat Saber and Tilt Brush.

Copyright ©2022–2023 UC Regents  |  Email us at rdi@berkeley.edu.