|
Revision 552, 0.7 kB
(checked in by yusukebe, 11 months ago)
|
|
motteke ERO YAML
|
| Line | |
|---|
| 1 | # http://duga.jp/ |
|---|
| 2 | # act is not set yet... |
|---|
| 3 | author: yusukebe |
|---|
| 4 | custom_feed_handle: http://duga\.jp/.*? |
|---|
| 5 | custom_feed_follow_link: http://duga\.jp/*.?/(ppv|month)/.*?\d{4}/ |
|---|
| 6 | handle: http://duga\.jp/*.?/(ppv|month)/.*?\d{4}/ |
|---|
| 7 | extract: <meta name="description" content="(.*?)".*?<h1>(.*?)</h1>.*?<div class="product-img">.*?<img src="(.*?)".*?レーベル.*?<a href="/keyword/.*?">(.*?)</a>.*?公開開始日.*?<td style="padding-top:2px;"><p>(.*?)</p>.*?カテゴリ.*?<td style="padding-top:2px;"><p>(.*?)</p> |
|---|
| 8 | extract_capture: body title img label date category |
|---|
| 9 | extract_date_format: %y年%m月%d日 |
|---|
| 10 | extract_after_hook: | |
|---|
| 11 | $data->{category} =~ s/<a href=.*?>(.*?)<\/a>/$1/g; |
|---|
| 12 | $data->{category} =~ s/>/,/g; |
|---|
| 13 | $data->{tags} = $data->{category} . "," . $data->{label} |
|---|