DATA
USERS
RECORDINGS
COUNTRIES
XR DEVICES
FORMAT
2023 | Vivek Nair · Wenbo Guo · Rui Wang · James F. O’Brien · Louis Rosenberg · Dawn Song
The BOXRR-23 dataset contains 4,717,215 motion capture recordings generated by 105,852 real users of extended reality (XR) devices. We sourced these recordings from a number of broadly publicly-available sources, converted them all into a single format, combined them with additional metadata from further open-access APIs, removed identifiable user data, and finally packaged the recordings into a unified dataset for use by the research community. The dataset totals 4.7 TB in compressed size, and expands to over 8.0 TB of raw data. For ease of access, we have split the data into 106 chunks, each containing up to 1,000 users, with an average size of 45 GB per chunk.
As seen in:
Dataset nutrition facts pursuant to the The Data Nutrition Project, https://arxiv.org/abs/1805.03677
Dataset BOXRR-23
Instances Per Dataset 4,717,215
Metadata
Original Authors
Vivek Nair, UC Berkeley
Wenbo Guo, UC Berkeley
Rui Wang, UC Berkeley
James F. O'Brien, UC Berkeley
Louis Rosenberg, Unanimous AI
Dawn Song, UC Berkeley
Owner
Berkeley RDI Center
Creator
Berkeley RDI Center
Maintainer
Berkeley RDI Center
Version
2023
URL
rdi.berkeley.edu/metaverse/boxrr-23
DOI
doi.org/10.25350/B5NP4V
License
CC BY-NC-SA 4.0
Curated
APR 2023
Original Funding
National Science Foundation
National Physical Science Consortium
Fannie and John Hertz Foundation
Berkeley RDI Center
Ongoing Funding
Berkeley RDI Center
Keywords
XR, VR, AR, MR, MoCap, HCI, CGI, AI, ML
Composition
Data Dictionary
rdi.berkeley.edu/metaverse/boxrr-23/dict.json
Format
XROR
Timeframe
From
To
NOV 2017
APR 2023
Upstream Sources
BeatLeader (beatleader.xyz)
ScoreSaber (scoresaber.com)
PolyGone (polygone.art)
Steam (steampowered.com)
BeatSaver (beatsaver.com)
Source
% of Recordings
BeatLeader 3,525,456 recordings
ScoreSaber 1,136,581 recordings
PolyGone 55,178 recordings
74.7%
24.1%
1.2%
Ethics
Ethics Review
Berkeley OPHS #2023-03-16120
Human Data
Yes
Individual Data
Yes
Consent Given
Yes
Community Involvement
Yes
Sensitive Content
Maybe
Confidential Data
No
Subpopulations
Country
Restrictions
rdi.berkeley.edu/metaverse/boxrr-23/dua.pdf
Processing
Imputation
None
Manipulation
None
Completeness
Complete
Raw Data Retained
Yes
Uses and Distribution
Domains
Security and Privacy
Graphics and CGI
Human-Computer Interaction
Machine Learning
Original Use
Authentication
Notable Uses
arxiv.org/abs/2302.08927
arxiv.org/abs/2208.05604
arxiv.org/abs/2305.19198
Other Uses
Motion Synthesis
Anti-Cheating
Score Prediction
Prohibited Uses
Deanonymization
Sensitive Attributes
Health Research
Maintenance and Evolution
Corrections or Erratum
None
Updates
Annual
Description
Copyright ©2022–2023 UC Regents | Email us at rdi@berkeley.edu.