§ 6.2 Input Encodings

 supports the standard  mechanism for handling non-ASCII encodings of the input  sources: using the inputenc package. The  binding of inputenc loads the encoding definition (generally with extension def) directly from the  distribution (which are generally well-enough behaved to be easily processed). These encoding definitions make the upper 128 code points (of 8 bit) active and define  macros to handle them.

Using the commandline option --inputencoding=utf8 to latexml allows processing of sources encoded as utf8, without any special packages loaded. [future work will make  compatible with xetex]