|
ABSTRACT
We describe a sparse Bayesian regression method for recovering 3D human body motion directly from silhouettes extracted from monocular video sequences. No detailed body shape model is needed, and realism is ensured by training on real human motion capture data. The tracker estimates 3D body pose by using Relevance Vector Machine regression to combine a learned autoregressive dynamical model with robust shape descriptors extracted automatically from image silhouettes. We studied several different combination methods, the most effective being to learn a nonlinear observation-update correction based on joint regression with respect to the predicted state and the observations. We demonstrate the method on a 54-parameter full body pose model, both quantitatively using motion capture based test sequences, and qualitatively on a test video sequence.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Agarwal, A., & Triggs, B. (2004a). 3D Human Pose from Silhouettes by Relevance Vector Regression. Int. Conf. Computer Vision & Pattern Recognition.
|
| |
2
|
Agarwal, A., & Triggs, B. (2004b). Tracking Articulated Motion with Piecewise Learned Dynamical Models. European Conf. Computer Vision.
|
| |
3
|
Athitsos, V., & Sclaroff, S. (2000). Inferring Body Pose without Tracking Body Parts. Int. Conf. Computer Vision & Pattern Recognition.
|
| |
4
|
Athitsos, V., & Sclaroff, S. (2003). Estimating 3D Hand Pose From a Cluttered Image. Int. Conf. Computer Vision.
|
| |
5
|
|
| |
6
|
|
| |
7
|
|
| |
8
|
|
| |
9
|
D'Souza, A., Vijayakumar, S., & Schaal, S. (2001). Learning Inverse Kinematics. Int. Conf. on Intelligent Robots and Systems.
|
| |
10
|
|
| |
11
|
Howe, N., Leventon, M., & Freeman, W. (1999). Bayesian Reconstruction of 3D Human Motion from Single-Camera Video. Neural Information Processing Systems.
|
| |
12
|
|
| |
13
|
|
| |
14
|
|
| |
15
|
Ormoneit, D., Sidenbladh, H., Black, M., & Hastie, T. (2000). Learning and Tracking Cyclic Human Motion. Neural Information Processing Systems (pp. 894--900).
|
| |
16
|
Pavlovic, V., Rehg, J., & MacCormick, J. (2000). Learning Switching Linear Models of Human Motion. Neural Information Processing Systems (pp. 981--987).
|
| |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
Sminchisescu, C., & Triggs, B. (2003). Kinematic Jump Processes For Monocular 3D Human Tracking. Int. Conf. Computer Vision & Pattern Recognition.
|
| |
21
|
|
| |
22
|
Taylor, C. (2000). Reconstruction of Articulated Objects from Point Correspondances in a Single Uncalibrated Image. Int. Conf. Computer Vision & Pattern Recognition.
|
| |
23
|
Tipping, M. (2000). The Relevance Vector Machine. Neural Information Processing Systems.
|
| |
24
|
|
| |
25
|
|
CITED BY 5
|
|
|
|
|
|
|
|
|
|
|
|
|
|
David A. Forsyth , Okan Arikan , Leslie Ikemoto , James O'Brien , Deva Ramanan, Computational studies of human motion: part 1, tracking and motion synthesis, Foundations and Trends® in Computer Graphics and Vision, v.1 n.2, p.77-254, July 2006
|
|