Is there any server based software, preferably open source, that will generate videos based on text, image and music file inputs?
ffmpeg , you need a basic php script that can take jpg images and convert it to videos. https://trac.ffmpeg.org/wiki/Create a video slideshow from images this is the command. ffmpeg -r 1 -pattern_type glob -i '*.png' -c:v libx264 -pix_fmt yuv420p out.mp4