[Tex/LaTex] Cyrillic first names problem with natbib and bibtex

bibtexcyrillicnatbib

I'm using natbib and bibtex, typesetting a document in Russian (with pdflatex) in chicago. For an entry like this:

@book{belyj1913,
Address = {Москва},
Author = {Андрей Белый},
Publisher = {Наука},
Title = {Петербург},
Year = {1981}}

while running it I get:

! Package inputenc Error: Unicode char \u8:�. not set up for use with LaTeX.

See the inputenc package documentation for explanation.
Type  H <return>  for immediate help.
 ...                                              

l.47 Белый, �.
                (1981).

And the first name's initial does not appear in the pdf.

The problem disappears if I use plainnat, which puts full first names in. However, I love certain things about chicago, and I wonder if this initials extraction problem can be overcome.

I use babel for Cyrillic, here's an excerpt from the preamble:

\usepackage[T2A]{fontenc}
\usepackage[utf8]{inputenc}
\usepackage[english,russian]{babel}
...
\usepackage{natbib}
\bibpunct[:]{(}{)}{;}{a}{}{,}

Latin references work just fine. Other local features, like hyphenation, work too.

The discussion in Cyrillic bibliographic entry using Biblatex and Polyglossia (initials broken) is for biblatex and polyglossia, neither of which I'm using.

Best Answer

BibTeX is not really suited for UTF-8 encoding. In many cases it works, but it will definitely have problems with extraction of initials, because, for example, a Cyrillic ‘А’ is two bytes (0xD090) and only the first byte is selected.

You can work around this by using the internal representation of the Cyrillic letter:

@book{belyj1913,
  Address = {Москва},
  Author = {{\CYRA}ндрей Белый},
  Publisher = {Наука},
  Title = {Петербург},
  Year = {1981},
}

which will work independently whether only initials or full names are used.

However, sorting by author names is not really guaranteed to be correct. You should try biblatex and biber, which are UTF-8 compliant.