NEW: www/py-html5-parser

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
9 messages Options
Reply | Threaded
Open this post in threaded view
|

NEW: www/py-html5-parser

Rafael Sadowski
Please find attached a new dependency for an possible calibre update.

Information for inst:py3-html5-parser-0.4.9

Comment:
fast C based HTML 5 parsing for python

Description:
A fast implementation of the HTML 5 parsing spec for Python. Parsing is done in
C using a variant of the gumbo parser. The gumbo parse tree is then transformed
into an lxml tree, also in C, yielding parse times that can be a thirtieth of
the html5lib parse times.

Maintainer: The OpenBSD ports mailing-list <[hidden email]>

WWW: https://pypi.python.org/pypi/html5-parser



py3-html5-parser-0.4.9.tar.gz (1K) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: NEW: www/py-html5-parser

Daniel Dickman


On Mon, 15 Feb 2021, Rafael Sadowski wrote:

> Please find attached a new dependency for an possible calibre update.
>

I think it needs dependencies on at least:
- py-lxml
- py-chardet
- py-beautifulsoup4

maybe more things too, but that's as far as I got.

Reply | Threaded
Open this post in threaded view
|

Re: NEW: www/py-html5-parser

Stuart Henderson
On 2021/02/15 14:02, Daniel Dickman wrote:

>
>
> On Mon, 15 Feb 2021, Rafael Sadowski wrote:
>
> > Please find attached a new dependency for an possible calibre update.
> >
>
> I think it needs dependencies on at least:
> - py-lxml
> - py-chardet
> - py-beautifulsoup4

beautifulsoup4 is optional. I wonder why portgen doesn't notice lxml/chardet.

> maybe more things too, but that's as far as I got.

there are forced -O3 in the C compiles.

Reply | Threaded
Open this post in threaded view
|

Re: NEW: www/py-html5-parser

Rafael Sadowski
On Tue Feb 16, 2021 at 04:42:07PM +0000, Stuart Henderson wrote:

> On 2021/02/15 14:02, Daniel Dickman wrote:
> >
> >
> > On Mon, 15 Feb 2021, Rafael Sadowski wrote:
> >
> > > Please find attached a new dependency for an possible calibre update.
> > >
> >
> > I think it needs dependencies on at least:
> > - py-lxml
> > - py-chardet
> > - py-beautifulsoup4
>
> beautifulsoup4 is optional. I wonder why portgen doesn't notice lxml/chardet.
>
> > maybe more things too, but that's as far as I got.
>
> there are forced -O3 in the C compiles.
>
Thanks for your feedback. New attempt attached with the following
changes:

- Remove 03 opts
- Added HOMEPAGE
- Added the following depends:

BUILD_DEPENDS = textproc/py-chardet${MODPY_FLAVOR} \
                        textproc/py-lxml${MODPY_FLAVOR} \
                        www/py-beautifulsoup4${MODPY_FLAVOR}

RUN_DEPENDS = www/py-beautifulsoup4${MODPY_FLAVOR}

py3-html5-parser-0.4.9p0.tar.gz (1K) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: NEW: www/py-html5-parser

Stuart Henderson
On 2021/02/16 21:50, Rafael Sadowski wrote:

> On Tue Feb 16, 2021 at 04:42:07PM +0000, Stuart Henderson wrote:
> > On 2021/02/15 14:02, Daniel Dickman wrote:
> > >
> > >
> > > On Mon, 15 Feb 2021, Rafael Sadowski wrote:
> > >
> > > > Please find attached a new dependency for an possible calibre update.
> > > >
> > >
> > > I think it needs dependencies on at least:
> > > - py-lxml
> > > - py-chardet
> > > - py-beautifulsoup4
> >
> > beautifulsoup4 is optional. I wonder why portgen doesn't notice lxml/chardet.
> >
> > > maybe more things too, but that's as far as I got.
> >
> > there are forced -O3 in the C compiles.
> >
>
> Thanks for your feedback. New attempt attached with the following
> changes:
>
> - Remove 03 opts
> - Added HOMEPAGE
> - Added the following depends:
>
> BUILD_DEPENDS = textproc/py-chardet${MODPY_FLAVOR} \
> textproc/py-lxml${MODPY_FLAVOR} \
> www/py-beautifulsoup4${MODPY_FLAVOR}
>
> RUN_DEPENDS = www/py-beautifulsoup4${MODPY_FLAVOR}

beautifulsoup4 is an optional run dependency.
py-lxml and py-chardet are required run depencies.
None of them are needed at build time.

Reply | Threaded
Open this post in threaded view
|

Re: NEW: www/py-html5-parser

Daniel Dickman


> On Feb 16, 2021, at 4:57 PM, Stuart Henderson <[hidden email]> wrote:
>
> On 2021/02/16 21:50, Rafael Sadowski wrote:
>>> On Tue Feb 16, 2021 at 04:42:07PM +0000, Stuart Henderson wrote:
>>> On 2021/02/15 14:02, Daniel Dickman wrote:
>>>>
>>>>
>>>> On Mon, 15 Feb 2021, Rafael Sadowski wrote:
>>>>
>>>>> Please find attached a new dependency for an possible calibre update.
>>>>>
>>>>
>>>> I think it needs dependencies on at least:
>>>> - py-lxml
>>>> - py-chardet
>>>> - py-beautifulsoup4
>>>
>>> beautifulsoup4 is optional. I wonder why portgen doesn't notice lxml/chardet.
>>>
>>>> maybe more things too, but that's as far as I got.
>>>
>>> there are forced -O3 in the C compiles.
>>>
>>
>> Thanks for your feedback. New attempt attached with the following
>> changes:
>>
>> - Remove 03 opts
>> - Added HOMEPAGE
>> - Added the following depends:
>>
>> BUILD_DEPENDS =        textproc/py-chardet${MODPY_FLAVOR} \
>>            textproc/py-lxml${MODPY_FLAVOR} \
>>            www/py-beautifulsoup4${MODPY_FLAVOR}
>>
>> RUN_DEPENDS =        www/py-beautifulsoup4${MODPY_FLAVOR}
>
> beautifulsoup4 is an optional run dependency.
> py-lxml and py-chardet are required run depencies.
> None of them are needed at build time.

bs4 should be a TDEP.


>

Reply | Threaded
Open this post in threaded view
|

Re: NEW: www/py-html5-parser

Andrew Hewus Fresh
In reply to this post by Stuart Henderson
On Tue, Feb 16, 2021 at 04:42:07PM +0000, Stuart Henderson wrote:

> On 2021/02/15 14:02, Daniel Dickman wrote:
> >
> >
> > On Mon, 15 Feb 2021, Rafael Sadowski wrote:
> >
> > > Please find attached a new dependency for an possible calibre update.
> > >
> >
> > I think it needs dependencies on at least:
> > - py-lxml
> > - py-chardet
> > - py-beautifulsoup4
>
> beautifulsoup4 is optional. I wonder why portgen doesn't notice lxml/chardet.

Because it's not listed under "requires_dist" on PyPI
https://pypi.org/pypi/html5-parser/json
(requires_dist is null)

None of those depends are listed anywhere in that JSON file afaict.

Reply | Threaded
Open this post in threaded view
|

Re: NEW: www/py-html5-parser

Rafael Sadowski
In reply to this post by Daniel Dickman
On Tue Feb 16, 2021 at 05:13:15PM -0500, Daniel Dickman wrote:

>
>
> > On Feb 16, 2021, at 4:57 PM, Stuart Henderson <[hidden email]> wrote:
> >
> > On 2021/02/16 21:50, Rafael Sadowski wrote:
> >>> On Tue Feb 16, 2021 at 04:42:07PM +0000, Stuart Henderson wrote:
> >>> On 2021/02/15 14:02, Daniel Dickman wrote:
> >>>>
> >>>>
> >>>> On Mon, 15 Feb 2021, Rafael Sadowski wrote:
> >>>>
> >>>>> Please find attached a new dependency for an possible calibre update.
> >>>>>
> >>>>
> >>>> I think it needs dependencies on at least:
> >>>> - py-lxml
> >>>> - py-chardet
> >>>> - py-beautifulsoup4
> >>>
> >>> beautifulsoup4 is optional. I wonder why portgen doesn't notice lxml/chardet.
> >>>
> >>>> maybe more things too, but that's as far as I got.
> >>>
> >>> there are forced -O3 in the C compiles.
> >>>
> >>
> >> Thanks for your feedback. New attempt attached with the following
> >> changes:
> >>
> >> - Remove 03 opts
> >> - Added HOMEPAGE
> >> - Added the following depends:
> >>
> >> BUILD_DEPENDS =        textproc/py-chardet${MODPY_FLAVOR} \
> >>            textproc/py-lxml${MODPY_FLAVOR} \
> >>            www/py-beautifulsoup4${MODPY_FLAVOR}
> >>
> >> RUN_DEPENDS =        www/py-beautifulsoup4${MODPY_FLAVOR}
> >
> > beautifulsoup4 is an optional run dependency.
> > py-lxml and py-chardet are required run depencies.
> > None of them are needed at build time.
>
> bs4 should be a TDEP.
>
>
> >

Thanks. I'm not familiar with extras_require,install_requires and the
whole python setup. OK with this dependencies?

LIB_DEPENDS = textproc/libxml

RUN_DEPENDS = textproc/py-chardet${MODPY_FLAVOR} \
                        textproc/py-lxml${MODPY_FLAVOR}

TEST_DEPENDS = www/py-beautifulsoup4${MODPY_FLAVOR}

Reply | Threaded
Open this post in threaded view
|

Re: NEW: www/py-html5-parser

Rafael Sadowski
On Wed Feb 17, 2021 at 06:35:53AM +0100, Rafael Sadowski wrote:

> On Tue Feb 16, 2021 at 05:13:15PM -0500, Daniel Dickman wrote:
> >
> >
> > > On Feb 16, 2021, at 4:57 PM, Stuart Henderson <[hidden email]> wrote:
> > >
> > > On 2021/02/16 21:50, Rafael Sadowski wrote:
> > >>> On Tue Feb 16, 2021 at 04:42:07PM +0000, Stuart Henderson wrote:
> > >>> On 2021/02/15 14:02, Daniel Dickman wrote:
> > >>>>
> > >>>>
> > >>>> On Mon, 15 Feb 2021, Rafael Sadowski wrote:
> > >>>>
> > >>>>> Please find attached a new dependency for an possible calibre update.
> > >>>>>
> > >>>>
> > >>>> I think it needs dependencies on at least:
> > >>>> - py-lxml
> > >>>> - py-chardet
> > >>>> - py-beautifulsoup4
> > >>>
> > >>> beautifulsoup4 is optional. I wonder why portgen doesn't notice lxml/chardet.
> > >>>
> > >>>> maybe more things too, but that's as far as I got.
> > >>>
> > >>> there are forced -O3 in the C compiles.
> > >>>
> > >>
> > >> Thanks for your feedback. New attempt attached with the following
> > >> changes:
> > >>
> > >> - Remove 03 opts
> > >> - Added HOMEPAGE
> > >> - Added the following depends:
> > >>
> > >> BUILD_DEPENDS =        textproc/py-chardet${MODPY_FLAVOR} \
> > >>            textproc/py-lxml${MODPY_FLAVOR} \
> > >>            www/py-beautifulsoup4${MODPY_FLAVOR}
> > >>
> > >> RUN_DEPENDS =        www/py-beautifulsoup4${MODPY_FLAVOR}
> > >
> > > beautifulsoup4 is an optional run dependency.
> > > py-lxml and py-chardet are required run depencies.
> > > None of them are needed at build time.
> >
> > bs4 should be a TDEP.
> >
> >
> > >
>
> Thanks. I'm not familiar with extras_require,install_requires and the
> whole python setup. OK with this dependencies?
>
> LIB_DEPENDS = textproc/libxml
>
> RUN_DEPENDS = textproc/py-chardet${MODPY_FLAVOR} \
> textproc/py-lxml${MODPY_FLAVOR}
>
> TEST_DEPENDS = www/py-beautifulsoup4${MODPY_FLAVOR}
>
ping, latest tarball attached for convenience.

py3-html5-parser-0.4.9p0.tar.gz (1K) Download Attachment