A method for forming a 3-D facial model obtains a reconstructed radiographic image volume of a patient and extracts a soft tissue surface of the patients face from the image volume and forms a dense point cloud of the extracted surface. Reflection images of the face are acquired using a camera, wherein each reflection image has a different corresponding camera angle with respect to the patient. Calibration data is calculated for one or more of the reflection images. A sparse point cloud corresponding to the reflection images is formed by processing the reflection images using multi-view geometry. The sparse point cloud is registered to the dense point cloud and a transformation calculated between reflection image texture data and the dense point cloud. The calculated transformation is applied for mapping texture data from the reflection images to the dense point cloud to form a texture-mapped volume image that is displayed.