Google did this for missing areas across the USA, had a largish setup of 360 degree cams on roof of cars - would just drive and record, thus google can offer maps down to the street level for some places - you can also Derive ground level with correct software.
ps: stitching is very easy of still pics, have no clue how to do that for video, would have to stitch frame by frame, would be a bit harder for compressed video as it is not frame by frame but is a composite of multiple frames to get a single frame - some only 'renew' what changed for very good compression and small file sizes - makes s/w much harder to stitch or edit in general. this is when all those 7,12,16 core cpus and graphic cards can be used as an oven !!
|