chaley, good point regarding individuals' name choices varying. I've been using ISFDB, Wikipedia, and author websites as my baseline verification of how authors like their names to be split, and still I probably get some wrong. The "van LastLastName" and "de la LastLastName" etc cases are rare enough I usually remember which way I've split them. Such as "Lustbader, Eric Van" and "van Vogt, A E". Bottom line is I don't care if I split it wrong but in each case I want to split it consistently the same way, to avoid having duplicate books under "Lustbader, Eric Van" and "Van Lustbader, Eric." After this discussion, which seems ridiculous, I'm tempted to ignore the way the authors like it and just split them the way the simple regex does and not worry about it.
Last edited by unboggling; 07-27-2011 at 12:57 PM.
|