Compiler Toolchains for Deep Learning Workloads on Embedded Platforms