One option would be to group the photo and the caption. The downside to this method is the text would be rasterized upon export and it might not look crisp. Another option is to create a line for the caption (as part of your body text) just below the inline image. Something like this.
![Caption.JPG]()