[Biopython] Biopython Digest, Vol 264, Issue 2
Ivan Erill
ivan.erill at gmail.com
Fri Apr 24 10:20:18 EDT 2026
Dear Peter,
I don't think there is a viable answer that does not go in the direction
you point towards (i.e. some sort of ban), if the aim is to keep manually
reviewing Biopython. It simply does not scale up. I believe the discussion
now for open-source projects like Biopython is whether to maintain manual
review or fully embrace AI both as contributor and maintainer/reviewer. I,
for one, would rather stick to manual review.
How to maintain manual review in an AI world, as Markus points out, is an
entirely different question. Some initiatives, like Vouch (
https://github.com/mitchellh/vouch) explore the old concept of a
web-of-trust. Simply pre-authorizing/vetting contributors via an old
fashioned mail list, rather than through github, may be an effective way to
screen out fully-automated PRs or PRs intended only to boost creds. It's
essentially a throttle question. Constrain too much and you'll lose some
quality contributions, open up too much and you'll get swamped. A fine line
to walk.
Incidentally, thank you and the main reviewer cadre for keeping Biopython
alive and kicking.
Ivan
On Fri, Apr 24, 2026 at 3:00 PM <biopython-request at biopython.org> wrote:
> Send Biopython mailing list submissions to
> biopython at biopython.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://mailman.open-bio.org/mailman/listinfo/biopython
> or, via email, send a message with subject or body 'help' to
> biopython-request at biopython.org
>
> You can reach the person managing the list at
> biopython-owner at biopython.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Biopython digest..."
>
>
> Today's Topics:
>
> 1. Generative AI policy for contributions to Biopython (Peter Cock)
> 2. Re: Generative AI policy for contributions to Biopython
> (Peter Cock)
> 3. Re: Generative AI policy for contributions to Biopython
> (Markus Piotrowski)
> 4. Re: Generative AI policy for contributions to Biopython
> (Andrew Dalke)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Fri, 24 Apr 2026 11:16:09 +0100
> From: Peter Cock <p.j.a.cock at googlemail.com>
> To: "biopython at biopython.org" <biopython at biopython.org>
> Subject: [Biopython] Generative AI policy for contributions to
> Biopython
> Message-ID:
> <
> CAKVJ-_603iM02QN-ORLuGvkNjiL2YL0RLKp-h20FKmRMLrmK-Q at mail.gmail.com>
> Content-Type: text/plain; charset="UTF-8"
>
> Dear Biopythoneers,
>
> We need to set out a generative AI policy for contributions to Biopython.
>
> There are now multiple recent PRs submitted by new contributors which
> are openly using AI tools, more that I suspect are, and now even AI
> assisted
> PRs from past contributors (where CV padding or other external metrics
> are unlikely to be driving this). These are generally more work to review
> than human written PRs, and that is a growing issue.
>
> I blogged about my views late last year - ending in the line "Right now, I
> still lean very much to saying no any PR using generative AI".
>
>
> https://blastedbio.blogspot.com/2025/11/thoughts-on-generative-ai-contributions.html
>
> Things will change (both tool capabilities, but also the social and legal
> interpetations) but that post still describes my views today - note I did
> not touch on the topic of communications there (see below).
>
> Recently Linux adopted what has been described as a balanced stance
> treating it as a tool with very clear expectations that usage MUST be
> declared
> and that the human submitter is responsible for (quoting these four
> points):
>
> * Reviewing all AI-generated code
> * Ensuring compliance with licensing requirements
> * Adding their own Signed-off-by tag to certify the DCO
> * Taking full responsibility for the contribution
>
> https://docs.kernel.org/process/coding-assistants.html
>
> That is pragmatic but ignores the legal and ethical minefield. We don't
> have a Developer Certificate of Origin (DCO), but I think the other
> points are a bare minimum for any Biopython policy.
>
> Most of my personal open source projects have only had a very small
> number of contributors, and I am comfortable with outright rejecting
> generative AI. I know some of the past/current Biopython contributors
> are more willing to embrace this technology though - so I doubt support
> for a simple ban would be unanimous.
>
> Speaking for a moment as the current Open Bioinformatics Foundation
> president, the board has discussed this and agreed not to try to micro
> manage the member projects. For reference, BioPerl have started
> https://github.com/bioperl/bioperl-live/issues/407 which has some
> excellent points and examples to consider.
>
> In particular, this is not just a code or documentation changes issue - but
> also about the communication around any proposed change: the nature
> of the commit messages, pull request description, and discussion. This
> ties into the maintainers' burden - many of our recent AI generated PRs
> have fairy short code changes but the verbose text is exhausting to read
> and unhelpful. It has sometimes felt like I have been talking to an AI
> agent
> rather than a human - I actually liked the feeling of mentoring a new
> contributor and guiding them through minor hurdles to getting their
> change accepted, but you lose that with an AI agent inbetween you.
>
> I therefore very much like this line from the curreth Codeberg policy:
>
> > All communication, that includes: commit messages, pull request
> > messages, documentation, code comments and issues (and
> > comments on issues/pull requests), that is intended to be read
> > by people to understand your thoughts and work must not have
> > been generated with AI. We exclude machine translation and
> > tooling that helps with grammar and spelling check.
>
> https://codeberg.org/comaps/Governance/src/branch/main/AI_USAGE.md
>
> Would anyone like to speak in defence of accepting AI (assisted) PRs,
> and suggest an existing policy you would be happy we adopt or base
> ours on?
>
> Or should I start drafting a more draconian but likely much shorter one -
> a few lines like this in the CONTRIBUTING file and/or PR template: No
> generative AI to be used in any Biopython contributions, with the exception
> of machine translation to/from English (where you might consider including
> your original language text as well).
>
> Thank you,
>
> Peter
>
>
> ------------------------------
>
> Message: 2
> Date: Fri, 24 Apr 2026 11:26:28 +0100
> From: Peter Cock <p.j.a.cock at googlemail.com>
> To: "biopython at biopython.org" <biopython at biopython.org>
> Subject: Re: [Biopython] Generative AI policy for contributions to
> Biopython
> Message-ID:
> <CAKVJ-_59PsJ=
> oX_VGE6O20-qdcGhuVu7noFmOFO1Oqzk+qk8vg at mail.gmail.com>
> Content-Type: text/plain; charset="UTF-8"
>
> I just posted this on Mastodon (neither I nor Biopython or the OBF use
> X/Twitter
> anymore):
>
> https://fediscience.org/@pjacock/116143426085553346
>
> And that reminded me of an earlier remark I made:
>
> > Seems the recent #GenerativeAI #slop pull requests that I've looked at
> for
> > #Biopython have preferentially targeted the "Good First Issues". We
> really
> > wanted those to be onboarding ramps for new #OpenSource contributors -
> > and not for padding anyone's GitHub profile or whatever the motivation
> here is.
> >
> > So I think any formal policy will want to say explicitly #NoAI on those
> issues
> > at the very least.
>
> https://fediscience.org/@pjacock/116161804363016972
>
> Peter
>
> On Fri, Apr 24, 2026 at 11:16?AM Peter Cock <p.j.a.cock at googlemail.com>
> wrote:
> >
> > Dear Biopythoneers,
> >
> > We need to set out a generative AI policy for contributions to Biopython.
> >
> > There are now multiple recent PRs submitted by new contributors which
> > are openly using AI tools, more that I suspect are, and now even AI
> assisted
> > PRs from past contributors (where CV padding or other external metrics
> > are unlikely to be driving this). These are generally more work to review
> > than human written PRs, and that is a growing issue.
> >
> > I blogged about my views late last year - ending in the line "Right now,
> I
> > still lean very much to saying no any PR using generative AI".
> >
> >
> https://blastedbio.blogspot.com/2025/11/thoughts-on-generative-ai-contributions.html
> >
> > Things will change (both tool capabilities, but also the social and legal
> > interpetations) but that post still describes my views today - note I did
> > not touch on the topic of communications there (see below).
> >
> > Recently Linux adopted what has been described as a balanced stance
> > treating it as a tool with very clear expectations that usage MUST be
> declared
> > and that the human submitter is responsible for (quoting these four
> points):
> >
> > * Reviewing all AI-generated code
> > * Ensuring compliance with licensing requirements
> > * Adding their own Signed-off-by tag to certify the DCO
> > * Taking full responsibility for the contribution
> >
> > https://docs.kernel.org/process/coding-assistants.html
> >
> > That is pragmatic but ignores the legal and ethical minefield. We don't
> > have a Developer Certificate of Origin (DCO), but I think the other
> > points are a bare minimum for any Biopython policy.
> >
> > Most of my personal open source projects have only had a very small
> > number of contributors, and I am comfortable with outright rejecting
> > generative AI. I know some of the past/current Biopython contributors
> > are more willing to embrace this technology though - so I doubt support
> > for a simple ban would be unanimous.
> >
> > Speaking for a moment as the current Open Bioinformatics Foundation
> > president, the board has discussed this and agreed not to try to micro
> > manage the member projects. For reference, BioPerl have started
> > https://github.com/bioperl/bioperl-live/issues/407 which has some
> > excellent points and examples to consider.
> >
> > In particular, this is not just a code or documentation changes issue -
> but
> > also about the communication around any proposed change: the nature
> > of the commit messages, pull request description, and discussion. This
> > ties into the maintainers' burden - many of our recent AI generated PRs
> > have fairy short code changes but the verbose text is exhausting to read
> > and unhelpful. It has sometimes felt like I have been talking to an AI
> agent
> > rather than a human - I actually liked the feeling of mentoring a new
> > contributor and guiding them through minor hurdles to getting their
> > change accepted, but you lose that with an AI agent inbetween you.
> >
> > I therefore very much like this line from the curreth Codeberg policy:
> >
> > > All communication, that includes: commit messages, pull request
> > > messages, documentation, code comments and issues (and
> > > comments on issues/pull requests), that is intended to be read
> > > by people to understand your thoughts and work must not have
> > > been generated with AI. We exclude machine translation and
> > > tooling that helps with grammar and spelling check.
> >
> > https://codeberg.org/comaps/Governance/src/branch/main/AI_USAGE.md
> >
> > Would anyone like to speak in defence of accepting AI (assisted) PRs,
> > and suggest an existing policy you would be happy we adopt or base
> > ours on?
> >
> > Or should I start drafting a more draconian but likely much shorter one -
> > a few lines like this in the CONTRIBUTING file and/or PR template: No
> > generative AI to be used in any Biopython contributions, with the
> exception
> > of machine translation to/from English (where you might consider
> including
> > your original language text as well).
> >
> > Thank you,
> >
> > Peter
>
>
> ------------------------------
>
> Message: 3
> Date: Fri, 24 Apr 2026 13:43:39 +0200
> From: Markus Piotrowski <Markus.Piotrowski at ruhr-uni-bochum.de>
> To: biopython at biopython.org
> Subject: Re: [Biopython] Generative AI policy for contributions to
> Biopython
> Message-ID: <a48519d2-8a7f-45cf-a6e6-3bb0b02fc80a at ruhr-uni-bochum.de>
> Content-Type: text/plain; charset=UTF-8; format=flowed
>
> Dear Peter,
>
> While I also tend to strongly restrict or even forbid the usage of AI in
> Biopython, I wonder how you really can prevent this. A careful submitter
> can mask the AI signs in his/her code so that I will go undetected. So
> wouldn't it be better to allow the usage under strong restrictions and
> conditions (and I agree with the those that you have mentioned,
> including the "good first issues" in your other e-mail) to encourage
> potential contributors to be transparent about the of use of AI?
>
> I'm unsure about this, but I wanted to include this topic into the
> discussion.
>
> Best
> Markus
>
> Am 24.04.2026 um 12:16 schrieb Peter Cock:
> > Dear Biopythoneers,
> >
> > We need to set out a generative AI policy for contributions to Biopython.
> >
> > There are now multiple recent PRs submitted by new contributors which
> > are openly using AI tools, more that I suspect are, and now even AI
> assisted
> > PRs from past contributors (where CV padding or other external metrics
> > are unlikely to be driving this). These are generally more work to review
> > than human written PRs, and that is a growing issue.
> >
> > I blogged about my views late last year - ending in the line "Right now,
> I
> > still lean very much to saying no any PR using generative AI".
> >
> >
> https://blastedbio.blogspot.com/2025/11/thoughts-on-generative-ai-contributions.html
> >
> > Things will change (both tool capabilities, but also the social and legal
> > interpetations) but that post still describes my views today - note I did
> > not touch on the topic of communications there (see below).
> >
> > Recently Linux adopted what has been described as a balanced stance
> > treating it as a tool with very clear expectations that usage MUST be
> declared
> > and that the human submitter is responsible for (quoting these four
> points):
> >
> > * Reviewing all AI-generated code
> > * Ensuring compliance with licensing requirements
> > * Adding their own Signed-off-by tag to certify the DCO
> > * Taking full responsibility for the contribution
> >
> > https://docs.kernel.org/process/coding-assistants.html
> >
> > That is pragmatic but ignores the legal and ethical minefield. We don't
> > have a Developer Certificate of Origin (DCO), but I think the other
> > points are a bare minimum for any Biopython policy.
> >
> > Most of my personal open source projects have only had a very small
> > number of contributors, and I am comfortable with outright rejecting
> > generative AI. I know some of the past/current Biopython contributors
> > are more willing to embrace this technology though - so I doubt support
> > for a simple ban would be unanimous.
> >
> > Speaking for a moment as the current Open Bioinformatics Foundation
> > president, the board has discussed this and agreed not to try to micro
> > manage the member projects. For reference, BioPerl have started
> > https://github.com/bioperl/bioperl-live/issues/407 which has some
> > excellent points and examples to consider.
> >
> > In particular, this is not just a code or documentation changes issue -
> but
> > also about the communication around any proposed change: the nature
> > of the commit messages, pull request description, and discussion. This
> > ties into the maintainers' burden - many of our recent AI generated PRs
> > have fairy short code changes but the verbose text is exhausting to read
> > and unhelpful. It has sometimes felt like I have been talking to an AI
> agent
> > rather than a human - I actually liked the feeling of mentoring a new
> > contributor and guiding them through minor hurdles to getting their
> > change accepted, but you lose that with an AI agent inbetween you.
> >
> > I therefore very much like this line from the curreth Codeberg policy:
> >
> >> All communication, that includes: commit messages, pull request
> >> messages, documentation, code comments and issues (and
> >> comments on issues/pull requests), that is intended to be read
> >> by people to understand your thoughts and work must not have
> >> been generated with AI. We exclude machine translation and
> >> tooling that helps with grammar and spelling check.
> > https://codeberg.org/comaps/Governance/src/branch/main/AI_USAGE.md
> >
> > Would anyone like to speak in defence of accepting AI (assisted) PRs,
> > and suggest an existing policy you would be happy we adopt or base
> > ours on?
> >
> > Or should I start drafting a more draconian but likely much shorter one -
> > a few lines like this in the CONTRIBUTING file and/or PR template: No
> > generative AI to be used in any Biopython contributions, with the
> exception
> > of machine translation to/from English (where you might consider
> including
> > your original language text as well).
> >
> > Thank you,
> >
> > Peter
> > _______________________________________________
> > Biopython mailing list - Biopython at biopython.org
> > https://mailman.open-bio.org/mailman/listinfo/biopython
>
>
>
> ------------------------------
>
> Message: 4
> Date: Fri, 24 Apr 2026 14:53:15 +0200
> From: Andrew Dalke <dalke at dalkescientific.com>
> To: Markus Piotrowski <Markus.Piotrowski at ruhr-uni-bochum.de>
> Cc: biopython at biopython.org
> Subject: Re: [Biopython] Generative AI policy for contributions to
> Biopython
> Message-ID: <A18E3F32-46C0-4AFC-8092-D18DA3B47E86 at dalkescientific.com>
> Content-Type: text/plain; charset=us-ascii
>
> On Apr 24, 2026, at 13:43, Markus Piotrowski <
> Markus.Piotrowski at ruhr-uni-bochum.de> wrote:
> > I wonder how you really can prevent this.
>
> The same way Biopython prevents someone from contributing code without
> permission of the copyright holder, or code which implements an algorithm
> under patent protection - say "no", be aware that people may do it, and if
> that happens and is found out, do what can be done to remove it.
>
>
> > Am 24.04.2026 um 12:16 schrieb Peter Cock:
> >> I blogged about my views late last year - ending in the line "Right
> now, I
> >> still lean very much to saying no any PR using generative AI".
>
> I'm one of those contributors, leaders, and maintainers who have gone and
> went. I know I have no say at all about the project direction.
>
> Given what little it's worth, I support the no-AI position.
>
> I long ago moved over to cheminformatics. I've had the displeasure of
> seeing AI-generated code bases in my field. One was clearly using AI for
> not-actually "clean room" rewrite of an existing code base, and falsely
> believed that doing so removed copyright protection. Another contained code
> obviously derived from an open source code base, but without the required
> attribution. "Obvious" to someone like me, who has looked at the original
> source and alternative implementations, but the novice developer using AI
> tooling likely didn't realize it was plagiarized.
>
> Saying "the human submitter is responsible" assumes the contributor
> understands what it means to be responsible. As a general rule, scientists
> are not trained in the nuances of copyright law, nor have the time to
> evaluate the severe negative effects of these code generation tools while
> bombarded by "use AI!" boosterism, all under pressure of writing a thesis
> or paper.
>
> Some 20-odd years ago, when I helped manage the BOSC talk submission
> process, I would get submissions saying something like "code available for
> academic use only". I explained why that wasn't open source, and most
> (all?) changed their licenses to an actual open source license. They simply
> weren't aware, and needed guidance.
>
> Saying "No generative AI", and following up on it, is also a way to make
> people aware, and provide guidance.
>
> Andrew
> dalke at dalkescientific.com
>
>
>
>
>
> ------------------------------
>
> Subject: Digest Footer
>
> _______________________________________________
> Biopython mailing list - Biopython at biopython.org
> https://mailman.open-bio.org/mailman/listinfo/biopython
>
>
> ------------------------------
>
> End of Biopython Digest, Vol 264, Issue 2
> *****************************************
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.open-bio.org/pipermail/biopython/attachments/20260424/e50d8e9b/attachment-0001.htm>
More information about the Biopython
mailing list