Add a module that makes use of my own ROMIO for lustre



https://github.com/ufs-community/ufs-weather-model/issues/2347
For serial I/O, someone said NetCDF is faster than PnetCDF.
This may be because of MPI-I/O collective overhead, or
hash table size being too small.
* Check if communicator size is 1, then automatically switch to independent I/O
  mode.

----- done in PR 149

hash table size has been increased to 256

