The present disclosure pertains to an encoding device and encoding method, and a decoding device and decoding method which make it possible to obtain depth image data and two-dimensional image data for a viewpoint that corresponds to a prescribed display image generation method, regardless of the viewpoint at the time of image capture. From three-dimensional data of an imaging subject generated from two-dimensional image data for a plurality of viewpoints, a conversion unit generates two-dimensional image data for a plurality of viewpoints that correspond to the prescribed display image generation method, and generates depth image data expressing the location in the depth direction of the imaging subject for each pixel. An encoding unit encodes the depth image data and two-dimensional image data generated by the conversion unit. A transmission unit transmits the depth image data and two-dimensional image data encoded by the encoding unit. This disclosure is applicable, for example, to an encoding device or th