[Tex/LaTex] Why is biber so slow

biberbiblatex

I have a simple document, about 20 pages long with some mathematics and a couple of tikz diagrams. I use biblatex and biber (0.9.9 MacTeX 2011) to compile my references, of which there are currently 2, with maybe 10 citations.

When using the bibtex backend the bibliography stage of my compilation takes under a second. Biber takes over 5 seconds to process exactly the same file.

I use TeXShop and with biber the console window appears but none of biber's command line output appears for at least 4 seconds.

Is there a problem with my setup or is biber slow by design?

EDIT: yes, it is slow every time. I've done some digging into the folders where biber unpacks its perl dependencies and I think TeXShop might be unpacking it every time. Perhaps something deletes the unpacked binary after each use.

The way I've made TeXShop use biber is to change the bibTeX engine field in the preferences to `biber'.

Best Answer

It is slower than bibtex which is in C, even if you take into consideration the first run unpacking. Bear in mind that biber does a lot more than bibtex too. They are hardly comparable in functionality at all. Your tikz and maths should make no difference to biber. If your cache is getting deleted every time you run, this will make a huge difference. Easy to check this - delete the cache and run. Is the second biber run any faster?

The main overhead is sorting. It is a complex business, dealing with much more than bibtex - Unicode 7.0, direction per-field, case per field ... Next overhead is uniqueness processing. Again, complex. Bibtex probably does about 20% of what biber does. See the biber PDF manual to get a sense of its share of the biblatex work.

As of version 2.5 (currently in DEV), I have done some profiling with NYTProf. The majority of bibers time is spent inside the Unicode::Collate module (written in C), as one would expect as sorting is a main focus and it's expensive to do tailored UCA sorting (which bibtex doesn't even come close to doing). After some examining of the call stacks, I've done some loop tidying for sorting calls and now biber 2.5 is about four times as fast as 2.4 and probably all earlier versions.

As mentioned in the doc, for performance testing, I use a 2150 entry, 15,000 line .bib file which references a 630 entry macro file with a resulting 160 or so page bibliography. In biber 2.4 this takes about 2 minutes to process. In the current 2.5 development version it takes about 28 seconds. This is almost the same now as when using the --fastsort option which doesn't use Unicode collation (so I may drop --fastsort since it is functionally far less useful and if there is no performance benefit, there is no longer any point in it).

Related Solutions

[Tex/LaTex] Using Biber with latexmk

exists $generated_log{"$bbl_base.bcf"} does a test if the list of files %generated_log includes an entry for "$bbl_base.bcf". From my point of view such an entry is missing in the list %generated_log, the reason why the call fails. If you test it with -e which does in general the same, it works in fact of the now missing check of %generated_log. You should report it to the author. It could also be possible that some of the Perl functions have not the same behaviour as with Linux.

[Tex/LaTex] Problem getting biber/biblatex/XeLaTeX/TeXworks working together

You might need to change how Texworks calls Biber. I was able to get the Korean text to appear without any issue by letting Biber know that the input and output were unicode.

To modify the arugments Texworks sends to Biber open up the typesetting preferences: Edit->Preferences->Typesetting. In the Processing Tools box, select Biber then click on edit. In the window that pops up, you can add arguments. I have -U -u and $basename, all as separate entries. The "U/u" entries tell Biber that the input and the ouput are utf8.

If you routinely use non-utf8 .bib files then you can achieve the same functionality by calling biblatex in the following manner:

\usepackage[backend=biber,texencoding=utf8,bibencoding=utf8]{biblatex}

The default behavior without those arguments is that Biblatex is supposed to detect automatically the encoding of the .tex document and assume that the .bib file is the same. It might be the case, as it was for me, that biblatex needs a little reminder that it should be using utf8.

Using this set up, the I had no problems compiling the MWE you provided above. I did run into the issue of no bibliography or citations appearing when using style=apa in biblatex. I am not familiar with this style so I cannot troubleshoot what is happening here. But I was able to get Korean to appear when I set style=mla. I realize that this might not be your desired style or the style you are required to use, but it at least demonstrates that Texworks using Biblatex and Biber can process the Korean text you ask it to compile.

Best Answer

Related Solutions

[Tex/LaTex] Using Biber with latexmk

[Tex/LaTex] Problem getting biber/biblatex/XeLaTeX/TeXworks working together

Related Question