datatracker/ietf
Henrik Levkowetz 0b8bcfa81d Fix a series of issues found during testing. This is the patch provide
to Yaco on 2011-03-19, and committed on branch/yaco/idsubmit as [2896].

   * Extraction of Title which don't have the draft name on a separate
     page fails.  See for instance this example:
     http://www.ietf.org/staging/draft-ma-cdni-publisher-use-cases-00.txt
     The regex should maybe be updated to permit but not require a newline
     before the draft filename:
     '(?:\n\s*\n\s*)((.+\n){1,2}(.+\n?))(\s+<?draft-\S+\s*\n)\s*\n'

   * If there are blank lines before the start of the author list on the
     first page, the author extraction will fail.  This sometimes happens
     when there's junk at the start of a draft, see for instance
     http://www.ietf.org/id/draft-ietf-mpls-tp-process-00.txt .

   * Sometimes the Authors' Addresses section lists authors with the same
     workplace address on the same line: "Sam Spade and Joe Smith".  This
     needs a fix in the author extraction code.

   * Sometimes the order of first name, surname is different on the first
     page and in the author list, and sometimes the surname is uppercase
     in one place, but not in the other.  This also needs a fix in the
     author extraction code.

   * The header stripping code had a bug, where multiple blank lines could
     be replaced by a single blank line in the stripped text, which could
     mess up title extraction.

   * Title space normalization should be done also for titles from the
     'unusual title format' code branch of the title extraction code.

   * Company names on the first page are sometimes rendered with different
     case than in the Authors' Addresses section.

   * Some drafts list the draft filename _before_ the title, rather than
     after the title.  Permit this too. Covered in the patch.

   * Spanish names can be shown as either
	<given_name> <fathers_first_surname> <mothers_first_surname>
     or less formally as
	<given_name> <fathers_first_surname>
     If the first form is used in the Authors' Addresses section, but the
     second form (with the given name possibly abbreviated to its first
     letter) the author extraction will fail.

   * Drafts containing tabs will be caught by idnits during I-D submission,
     but in case the drafts.py module is used independently from idnits,
     convert tabs to spaces in order for the author extraction and other
     methods to work as expected.  Example: recently submitted draft
     draft-bergeron-payload-rtpfec-rs-00.txt.

   * Found a draft with a previously unhandled header/footer format:
     draft-fang-mpls-tp-oam-toolset-01.txt.  Tweak needed for header/footer
      stripping.
 - Legacy-Id: 2919
Note: SVN reference [2896] has been migrated to Git commit 5a34b70e52
2011-03-24 13:05:48 +00:00
..
announcements Corrected send_scheduled_announcement fixes. 2010-11-09 08:55:23 +00:00
bin Merged from log:branch/iesg-tracker@2571: IOLA's port of cron-scripts. 2010-10-26 14:49:12 +00:00
contrib Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
cookies Merged [2676] from kivinen@iki.fi: 2010-11-11 02:31:38 +00:00
database-notes Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
idindex Catch bad or missing area when generating the all_id2 file. 2010-08-19 13:52:28 +00:00
idrfc Provide a full URL to the last call email, rather than just the local url path. 2011-03-24 12:37:45 +00:00
idtracker Tweak the admin interface for WGs and Chairs, adding some useful links to the tabular displays. 2011-03-24 10:11:55 +00:00
iesg As we're restoring the ability to re-schedule from the IESG agenda document list, we also restore the accompanying test. However, also retain the modified test which tested re-scheduling from the /edit/info/ page (under another test class name). 2011-01-28 12:15:33 +00:00
ietfauth Show user levels as more than numbers in the IESG login table. Add utility methods for use in the admin interface. 2011-03-24 10:07:46 +00:00
ipr Only try to open a legacy url if it leads to a file; not if it leads to a directory. 2011-01-28 09:52:14 +00:00
liaisons Fix typo. 2011-03-24 12:36:31 +00:00
mailinglists Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
meeting Merged [2679] from adam@nostrum.com: 2010-11-11 02:46:33 +00:00
proceedings Tweak model names and admin display to make it easier to manage things (and understand the meaning/content of the different tables). 2011-03-24 10:09:55 +00:00
redirects Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
templates Link to the State help page from the 'State:' label. 2011-02-17 18:19:00 +00:00
utils Fix a series of issues found during testing. This is the patch provide 2011-03-24 13:05:48 +00:00
wginfo Merged [2614] from peter.musgrave@magorcorp.com: Add column IPR to doc table for WG docs and by AD docs. Fix column in doc search results so IPR is shown correctly there too. Fixes issue #497. 2010-11-06 08:51:32 +00:00
.gitignore Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
__init__.py Set version info and settings back to development mode 2011-02-02 17:35:44 +00:00
context_processors.py Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
manage.py Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
middleware.py Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
settings.py Make the document announcement default Cc: somewhat configurable. Remove iab@iab.org (on request). 2011-02-21 12:45:09 +00:00
settings_sqlitetest.py Merged from log:branch/2.00@2363: Current release branch head to trunk. 2010-07-21 12:48:05 +00:00
urls.py Disable the bulk delete action from the admin interface. 2011-03-24 10:12:51 +00:00