expl3 – Using lthooks to Locally Hook Into Commands: A Detailed Approach

amsthmexpansionexpl3hookslthooks

Following a previous question of mine, as an exercise I'm trying to mimic the behavior of thmtools with amsthm using l3keys and lthooks. Now I am stuck on the hooks part. thmtools defines prehead, posthead, prefoot, and postfoot hooks both for each theorem environment and generically (applied to all theorems). The prehead and postfoot hooks are easy. For the posthead hook, I've figured out that the code should be added after the amsthm command \deferred@thm@head. (Or perhaps \@begintheorem? The difference is an \ignorespaces.)

For the generic hook, one can just do

\NewHook{ amsthm-keys/allthms/posthead }
\AddToHook { cmd/deferred@thm@head/after } { \UseHook { amsthm-keys/allthms/posthead } }

But for the local hooks, I only want to add code after \deferred@thm@head for that specific environment. As far as I can tell, adding to hooks is always global, so I don't see how to do this without possibly changing the definition of \deferred@thm@head.

Here's a MWE:

\documentclass{article}
\usepackage{amsthm,kantlipsum,tcolorbox}

\ExplSyntaxOn

\keys_define:nn { mbert/thm }
  {
    name         .tl_set:N = \l_mbert_thm_name_tl,
    preheadhook  .tl_set:N = \l_mbert_thm_preheadhook_tl,
    postheadhook .tl_set:N = \l_mbert_thm_postheadhook_tl,
    prefoothook  .tl_set:N = \l_mbert_thm_prefoothook_tl,
    postfoothook .tl_set:N = \l_mbert_thm_postfoothook_tl,
  }

\tl_new:N \l_mbert_thm_defaultkeys_tl
\keys_precompile:nnN { mbert/thm }
  {
    name         = \text_titlecase:n { \l_mbert_thm_envname_tl },
    preheadhook  = {},
    postheadhook = {},
    prefoothook  = {},
    postfoothook = {},
  }
  \l_mbert_thm_defaultkeys_tl

\NewHook{ amsthm-keys/allthms/prehead }
\NewHook{ amsthm-keys/allthms/posthead }
\NewHook{ amsthm-keys/allthms/prefoot }
\NewHook{ amsthm-keys/allthms/postfoot }

\AddToHook { cmd/deferred@thm@head/after } { \UseHook { amsthm-keys/allthms/posthead } }
% How to hook into \deferred@thm@head locally?

\NewDocumentCommand { \NewThm } { m O{} }
  { 
    \tl_set:Nn \l_mbert_thm_envname_tl { #1 }
    \tl_use:N \l_mbert_thm_defaultkeys_tl
    \keys_set:nn { mbert/thm } { #2 }
    \exp_args:NnV \AddToHook { env/#1/begin } \l_mbert_thm_preheadhook_tl % local prehead hook
    \AddToHook { env/#1/begin } { \UseHook { amsthm-keys/allthms/prehead } } % generic prehead hook
    \AddToHook { env/#1/end } { \UseHook { amsthm-keys/allthms/postfoot } } % generic postfoot hook
    \exp_args:NnV \AddToHook { env/#1/end } \l_mbert_thm_postfoothook_tl % local postfoot hook
    \mbert_thm_new:ne { #1 } { \l_mbert_thm_name_tl }
  }

\cs_new_eq:NN \mbert_thm_new:nn \newtheorem
\cs_generate_variant:Nn \mbert_thm_new:nn { ne }

\ExplSyntaxOff

\NewThm{theorem}[
    preheadhook=\begin{tcolorbox},
    postfoothook=\end{tcolorbox}
    ]
\NewThm{cor}[
    name=Corollary,
    postheadhook=ABC
    ]
\NewThm{lemma}

\AddToHook{amsthm-keys/allthms/posthead}{***}
\AddToHook{amsthm-keys/allthms/postfoot}{END THEOREM}

\begin{document}

\begin{theorem}
\kant[2][1]
\end{theorem}

\begin{cor}
\kant[2][1]
\end{cor}

\begin{lemma}
\kant[2][1]
\end{lemma}

\end{document}

I obviously can't add something like

\exp_args:NnV \AddToHook { cmd/deferred@thm@head/after } \l_mbert_thm_postheadhook_tl

to the definition of \NewThm as then each declaration of postheadhook= would add code to \deferred@thm@head.

Essentially I want the effect of

\AddToHook{env/theorem/begin}{\apptocmd{\deferred@thm@head}{CODE}{}{}}

but using the kernel hooks. From this answer, I know there's no direct way to get the behavior of \apptocmd. Is there general advice for locally hooking into commands from other packages like this?

Additional attempt

With @cfr's help in the comments, the idea of nesting \AddToHookNext{cmd/deferred@thm@head/after} inside \AddToHook{env/<envname>/begin} seems promising. Indeed, adding e.g.

\AddToHook { env/cor/begin } { \AddToHookNext { cmd/deferred@thm@head/after } {HHH} }

has the desired effect. However, when I try to make this automatic in the definition of \NewThm, nothing happens when postheadhook is given a value:

\NewDocumentCommand { \NewThm } { m O{} }
  { 
    \tl_set:Nn \l_mbert_thm_envname_tl { #1 }
    \tl_use:N \l_mbert_thm_defaultkeys_tl
    \keys_set:nn { mbert/thm } { #2 }
    \exp_args:NnV \AddToHook { env/#1/begin } \l_mbert_thm_preheadhook_tl % local prehead hook
    \AddToHook { env/#1/begin } { \UseHook { amsthm-keys/allthms/prehead } } % generic prehead hook
%%% This next line is new.
    \AddToHook { env/#1/begin } { \exp_args:NnV \AddToHookNext { cmd/deferred@thm@head/after } \l_mbert_thm_postheadhook_tl }
    \AddToHook { env/#1/end } { \UseHook { amsthm-keys/allthms/postfoot } } % generic postfoot hook
    \exp_args:NnV \AddToHook { env/#1/end } \l_mbert_thm_postfoothook_tl % local postfoot hook
    \mbert_thm_new:ne { #1 } { \l_mbert_thm_name_tl }
  }

Is it perhaps an expansion issue?

Yet another attempt

I don't understand why, but my above attempt seems to work if I prefix the added line with \exp_args:Nnf, as in

\exp_args:Nnf \AddToHook { env/#1/begin } { \exp_args:NnV \AddToHookNext { cmd/deferred@thm@head/after } \l_mbert_thm_postheadhook_tl }

o-type expansion does nothing, and e-type expansion produces an error if postheadhook contains something other than pure text. Just lucky, or is this how to use f-type expansion?

Best Answer

But for the local hooks, I only want to add code after \deferred@thm@head for that specific environment. As far as I can tell, adding to hooks is always global, so I don't see how to do this without possibly changing the definition of \deferred@thm@head

[…]

Essentially I want the effect of
\AddToHook{env/theorem/begin}{\apptocmd{\deferred@thm@head}{CODE}{}{}}
but using the kernel hooks.

Here are 5 different options:

\documentclass{article}
\pagestyle{empty}

\makeatletter \ExplSyntaxOn
    \def\pretext#1{before <#1>}
    \def\posttext#1{after <#1>}

    \NewDocumentEnvironment
        { one }
        { }
        { \pretext{one} }
        { \posttext{one} }

    \NewDocumentEnvironment
        { two }
        { }
        { \pretext{two} }
        { \posttext{two} }


    %%%%%%%%%%%%%%%%
    %%% Option 1 %%%
    %%%%%%%%%%%%%%%%
    %% Use \globaldefs to trick \hook_gput_code into actually behaving like a
    %% hypothetical \hook_put_code. This is a complete hack and wildly
    %% unsupported, so PLEASE DON'T ACTUALLY DO THIS.
    %
    % \hook_gput_code:nnn { env / two / begin } { . } {
    %     \PackageWarning{BAD~ IDEA!}{PLEASE~ DON'T~ ACTUALLY~ DO~ THIS!}
    %     \int_set:Nn \globaldefs { -1 }
    %     \hook_gput_code:nnn { cmd / pretext / after } { . } { \textbf { NEW } }
    %     \int_set:Nn \globaldefs { 0 }
    % }
    %%% End Option 1


    %%%%%%%%%%%%%%%%
    %%% Option 2 %%%
    %%%%%%%%%%%%%%%%
    %% Add the patch using a "cmd/..." hook at the beginning of the
    %% environment, and remove it at the end.
    %
    % \hook_gput_code:nnn { env / two / begin } { . } {
    %     \hook_gput_code:nnn { cmd / pretext / after } { . } { \textbf { NEW } }
    % }
    %
    % \hook_gput_code:nnn { env / two / end } { . } {
    %     \hook_gremove_code:nn { cmd / pretext / after } { . }
    % }
    %%% End Option 2


    %%%%%%%%%%%%%%%%
    %%% Option 3 %%%
    %%%%%%%%%%%%%%%%
    %% Add a one-time "cmd/..." hook at the beginning of the environment.
    %
    % \hook_gput_code:nnn { env / two / begin } { . } {
    %     \hook_gput_next_code:nn { cmd / pretext / after } { \textbf { NEW } }
    % }
    %%% End Option 3


    %%%%%%%%%%%%%%%%
    %%% Option 4 %%%
    %%%%%%%%%%%%%%%%
    %% Locally patch the command by directly using expl3 commands.
    %
    % \cs_generate_variant:Nn \cs_set:Npn { NpV }
    %
    % \cctab_const:Nn \c_package_cctab {
    %     \cctab_select:N \c_document_cctab
    %     \char_set_catcode_letter:N @
    % }
    %
    % \hook_gput_code:nnn { env / two / begin } { . } {
    %     \tl_set_rescan:Nnx
    %         \l_tmpa_tl
    %         { \cctab_select:N \c_package_cctab }
    %         { \cs_replacement_spec:N \pretext }
    %
    %     \tl_put_right:Nn \l_tmpa_tl { \textbf { NEW } }
    %     \cs_set:NpV \pretext #1 { \l_tmpa_tl }
    % }
    %%% End Option 4


    %%%%%%%%%%%%%%%%
    %%% Option 5 %%%
    %%%%%%%%%%%%%%%%
    %% Globally patch the command, but make the patch value depend on the current
    %% environment.
    %
    % \prop_new:N \g__example_aftertext_prop
    %
    % \hook_gput_code:nnn { cmd / pretext / after } { . } {
    %     \prop_item:NV \g__example_aftertext_prop \@currenvir
    % }
    %
    % \prop_gput:Nnn \g__example_aftertext_prop { two } { \textbf { NEW } }
    %%% End Option 5
\ExplSyntaxOff \makeatother

\begin{document}
    \begin{one}
        \emph{body}
    \end{one}

    \begin{two}
        \emph{body}
    \end{two}

    \begin{one}
        \emph{body}
    \end{one}

    \begin{two}
        \emph{body}
    \end{two}
\end{document}

I don't understand why, but my above attempt seems to work if I prefix the added line with \exp_args:Nnf, as in
\exp_args:Nnf \AddToHook { env/#1/begin } { \exp_args:NnV \AddToHookNext { cmd/deferred@thm@head/after } \l_mbert_thm_postheadhook_tl }
o-type expansion does nothing, and e-type expansion produces an error if postheadhook contains something other than pure text. Just lucky, or is this how to use f-type expansion?

e-type expansion will fail if postheadhook contains anything fragile. o-type expansion doesn't work because it expands exactly once, and \exp_args:NnV takes way more expansions than that to work.

f-type expansion is inappropriate 99% of the time, but it seems fine here. I'd slightly prefer

\exp_args:Nnf \AddToHook { env/#1/begin } {
    %          v   vvvvvvvvvvvv
    \exp_args:NNnV \exp_stop_f: \AddToHookNext { cmd/deferred@thm@head/after }
    \l_mbert_thm_postheadhook_tl
}

just in case \AddToHookNext were to blow up when expanded, but it (and most other LaTeX commands) are protected, so this doesn't really make a difference in this case.

(What would be best would be something like \hook_gput_code:nne … \hook_gput_next_code:ne … \exp_not:V \l_mbert_thm_postheadhook_tl, but that throws an error for some unknown reason.)

Why the above works

The interface for ltcmdhooks in \AddToHook is supposed to work as follows:

If an end user writes \AddToHook{cmd/name/before}{code}, and the hook cmd/name/before doesn't exist yet (which implies that the command \name doesn't have that hook "installed"), then the code tries to patch that hook in the command.

If the end user writes \AddToHook{cmd/name/before}{code}, and the hook cmd/name/before already exists, this (probably) means that the command \name already has that hook, so it just adds the code to the hook, and leaves the command be.

This means that a package author may want to fine-tune the position of the cmd/name/before hook (for example, \def\name{<some initialization>\UseHook{cmd/name/before}<definition>}), then we don't want ltcmdhooks patching the command again (it would be wrong to add the same hook twice), so we tell ltcmdhooks that the hook already exists by saying \ActivateGenericHook{cmd/name/before}, then patching is no longer attempted.

This works for your case because you then manually add the hook to the command, and then tell ltcmdhooks that pathching is no longer needed. See section 3 Package Author Interface of the ltcmdhooks documentation.

So in essence, you, as the package author, are appropriating the \appendix command, by adding the hook yourself (exactly where ltcmdhooks would add it), and then telling ltcmdhooks to not patch it by using \ActivateGenericHook.

If instead of \appendix you were adding hooks to \UniqueCommandFromMyPackage, then you could use \NewHook instead of \ActivateGenericHook (the effect would be identical), because there would be no possibility of a name conflict.

How LaTeX2ε handles this case now

The problem: Turns out in the described case we're in a dead-end. When you write a definition like

\def\foo#1{#1##X}

TeX stores its <replacement text> as a token list containing:

out_param 1, par_token #, letter X

(out_param 1 is #1 to be replaced by the actual parameter when the macro is expanded, par_token # is a catcode 6 #, and letter X is a catcode 11 X).

Then, when you expand \foo with #1 (par_token #, character 1), TeX replaces out_param 1 and you have:

par_token #, character 1, par_token #, letter X

which is equivalent to typing #1#X. If you plug that back into a new definition of \foo you'll have:

\def\foo#1{#1#X}

which is obviously wrong (and thus the Illegal parameter number error). And at this point you have no way to tell what was an actual parameter when the macro was defined, and what was a single parameter token.

Half solution: There is one very simple case that can be easily detected and solved (which coincidentally is the one in your question): a macro without parameters. In this case, the macro has no argument, so any loose ## in its definition cannot possibly be confused with a parameter, so we can treat this such macros as token lists (in the expl3 sense) and do something akin to \tl_put_right:Nn and problem solved.

Another relatively simpler case is when the macro has no ## in its definition. In this case we don't have to worry about confusing parameters, so we treat the macro normally (this was the case implemented initially). LaTeX uses a rather simple loop to check if a macro has a parameter token in its definition (\__hook_if_has_hash:nTF): it looks at every token in the defintion, and compares it with #.

The other half: When the macro falls into the general case of having both parameters and parameter tokens in its definition (like \foo above), then we have to manually re-double every parameter token in the definition, so that it can be re-made. To do that, instead of expanding \foo with #1, LaTeX expands it with \c_@@_hash_tl, so \foo{\c_@@_hash_tl} becomes a definition like:

\foo#1{\c_@@_hash_tl 1#X}

then we loop through the replacement text of the macro (inside the braces) and double every ##, and replace every \c_@@_hash_tl by a single #, which then gives:

\foo#1{#1##X}

and then we can do the definition normally (phew!)

Patching with `\scantokens`

(wordier description here)

Suppose a macro defined with

\long\def\mycmd[#1]#2{\typeout{#1//#2}}

To append some code to it via \scantokens, you first do \meaning\mycmd to get a string like:

\long macro:[#1]#2->\typeout {#1//#2}

(with usual \detokenize catcodes: all 12 except spaces, which are catcode 10), then you use a delimited macro to separate the <prefixes>, the <parameter text>, and the <replacement text>, roughly like this:

\def\split#1{\expandafter\splitaux\meaning#1\relax}
\expanded{%
  \noexpand\def\noexpand\splitaux#1\detokenize{macro:}#2->#3\relax}{%
    \def\prefixes{#1}%
    \def\parameter{#2}%
    \def\replacement{#3}}

(I'm using \def\prefixes{#1}, etc. for the sake of understandability, but in reality you would inject everything expandably instead; see the definition of \__kernel_prefix_arg_replacement:wN in expl3-code.tex, and \etb@patchcmd in etoolbox.sty if you're feeling brave).

At this point you have every part of the definition as a string separately. Now you can either append or prepend some code to \replacement (or replace some part of it, as it's done in \patchcmd), or in rarer cases change \prefixes or \parameter. At this point you have three strings, each of which is a part of the definition. To reconstruct the definition you need:

<prefixes>\def\mycmd<parameter text>{<replacement text>}

but the three parts you have are still catcode 12 tokens, which are no good. Here comes the \scantokens part: you rescan those strings back to "normal" tokens:

\expanded{%
  \noexpand\scantokens{%
  % <prefixes>\def         \mycmd<parameter text>{<replacement text>}
    \prefixes \def\noexpand\mycmd\parameter      {\replacement      <added material>}%
  }%
}

which, after \expanded does its job, becomes:

\scantokens{%
  \long\def\mycmd[#1]#2{\typeout {#1//#2}<added material>}%
}%

then \scantokens does its thing and turns everything into tokens using the current catcode settings, and then the definition is carried out normally.

The advantage of this method is that you can do virtually any manipulation in any part of the definition.

The disatvantages are a few:

You need to know what catcodes were in force when the definition was first made (when patching you usually need to verify that a simple round of \meaning–\scantokens doesn't change the meaning of the macro) otherwise you can't patch safely;
If the macro was created with some combination of \edef and \detokenize to forcibly make some catcode 12 tokens, you will probably not be able to patch that macro (for example, \splitaux as defined above in this answer cannot ever be patched with \patchcmd because it contains letters (for example m) of both catcodes 11 and 12);
If the <parameter text> of the macro contains the characters ->, you won't be able to patch the macro.

Patching with expansion+redefinition

This method is much simpler, but requires previous knowledge of how the macro was defined. This can be done in few cases, namely when you know exactly what the <parameter text> of the macro is. The cases known by the kernel are when the macro was defined with \DeclareRobustCommand, or with ltcmd (\NewDocumentCommand or \NewExpandableDocumentCommand), or with \newcommand with an optional argument, or when the macro takes no argument.

Suppose the same macro from before, but defined with:

\newcommand\mycmd[2][default]{\typeout{#1//#2}}

(it will have an internal macro called \\mycmd, but for the sake of simplicity let's call it \mycmd as well), then we know for sure its <parameter text> is [#1]#2. Knowing what arguments the macro expects, we can feed it #1, #2, ... as arguments, so for \mycmd we would do:

\mycmd[#1]{#2}

which would then expand to the <replacement text> of the macro, with the first parameter (#1) replaced by #₆1₁₂ (the parameter token # followed by the character 1). The patching scheme would be something like:

\expanded{%
  \def\noexpand\mycmd[#1]#2{%
    \unexpanded\expandafter{\mycmd[#1]{#2}<added material>}%
  }%
}

then after the \expanded is done you are left with:

\def\mycmd[#1]#2{\typeout{#1//#2}<added material>}}

which is exactly what you had with the \scantokens approach, except that you didn't turn tokens into a string, so catcodes don't matter at all here.

The advantages of this method are roughly the disadvantages of the \scantokens method:

catcodes don't matter at all;
you can patch complicated macros (including the \splitaux macro from before) using this method given you know exactly what its <parameter text> is;
the <parameter text> of the macro may contain any token your heart desires (as long as you know what token it is); and
this method doesn't need a sanity check to ensure that the macro can be patched correctly.

The disadvantage is the requirement for the method to work: you need to know exactly what the <parameter text> is.

[Tex/LaTex] ‘Generic hook is deprecated’ warning after update

Warnings of the form:

LaTeX hooks Warning: Generic hook 'file/after/<name>' is deprecated.
(hooks)              Use hook 'file/<name>/after' instead.

are due to a recent change in the LaTeX kernel in which we normalised generic hooks to have the variable part in the middle, because we had env/<name>/after and file/after/<name> which was simply confusing. Now the file, package, class, and include hooks have the same form as other hooks: file/<name>/after.

To avoid complete breakage of thousands of documents (including yours, dear reader), the old hook names will be available for a while, until packages (like translations) have time to adjust. The warning is just there as a reminder, but it is completely harmless for your document, so there is nothing to worry about (except maybe ask the package author for an update :).

Just for the sake of discoverability by search engines, similar warnings will be:

LaTeX hooks Warning: Generic hook 'package/after/<name>' is deprecated.
(hooks)              Use hook 'package/<name>/after' instead.

LaTeX hooks Warning: Generic hook 'class/after/<name>' is deprecated.
(hooks)              Use hook 'class/<name>/after' instead.

LaTeX hooks Warning: Generic hook 'include/after/<name>' is deprecated.
(hooks)              Use hook 'include/<name>/after' instead.

Additional attempt

Yet another attempt

Best Answer

Related Solutions

Hooks – Adding a Command Hook to \appendix with Cleveref

Why the above works

How LaTeX2ε handles this case now

Patching with \scantokens

Patching with expansion+redefinition

[Tex/LaTex] ‘Generic hook is deprecated’ warning after update

Related Question

Patching with `\scantokens`