Changeset 4378
- Timestamp:
- 01/10/08 23:34:32 (10 months ago)
- Location:
- lang/perl/misc/wikipejago
- Files:
-
- 2 modified
-
ChangeLog (modified) (1 diff)
-
ext-wpj-person.pl (modified) (2 diffs)
Legend:
- Unmodified
- Added
- Removed
-
lang/perl/misc/wikipejago/ChangeLog
r3802 r4378 1 2008-01-10 yto <yto at nais dot to> 2 3 * ext-wpj-person.pl: Japanese person name => All person name. 4 1 5 2007-12-30 yto <yto at nais dot to> 2 6 -
lang/perl/misc/wikipejago/ext-wpj-person.pl
r3802 r4378 1 1 #!/usr/bin/perl 2 # ウィキペディア (http://ja.wikipedia.org/) から 日本人の人名っぽいのを取り出す2 # ウィキペディア (http://ja.wikipedia.org/) から人名っぽいのを取り出す 3 3 # [Step.1] 4 4 # wget http://download.wikimedia.org/jawiki/latest/jawiki-latest-pages-articles.xml.bz2 … … 21 21 my $title = $1; 22 22 next if $title =~ m{^([^<]+:|\d{4})}; 23 if ($page =~ m{(Category:\d+年生|\| Born)} 24 and $page =~ m{Category:日本} 25 and $page !~ m{Category:日本生産} 23 if ($page =~ m{(Category:\d+年生)} 24 and $page !~ m{Category:[^\|]*(犬|馬)[\|\]]} 26 25 ) { 27 26 $title =~ s/\s+\(.+?\)\s*$//;
![(please configure the [header_logo] section in trac.ini)](/share/chrome/site/your_project_logo.png)