Posting a Google breadcrumb:
This weekend I tried to use Cloudera Manager with Apache Whirr to set up a hadoop cluster. Somewhat frustratingly, Cloudera Manager does not work on OSX, and Whirr has a bug that makes it not work properly on EC2. So I settled on creating a Virtual Machine running Ubuntu on my laptop, which I created using Vagrant.
When trying to run the Cloudera installer, I got this error:
The encoding you selected (UTF8) and the encoding that the
selected locale uses (LATIN1) do not match. This would lead to
misbehavior in various character string processing functions.
Rerun initdb and either do not specify an encoding explicitly,
or choose a matching combination.
I couldn’t find a solution online, but what ultimately worked was:
$ sudo su – root
$ export LANG=“en_US.utf8"
$ export LC_ALL="en_US.utf8”
$ (still as root) ./cloudera-manager-installer.bin
The problem is that for some reason, the default locale/encoding on the vagrant VM prototype images is plain English, not UTF8. Hope this helps.