如何从单独的脚本访问文档字符串？

Question

Building a GUI for users to select Python scripts they want to run.为用户构建他们想要运行的 select Python 脚本的 GUI。 Each script has its own docstring explaining inputs and outputs for the script.每个脚本都有自己的文档字符串来解释脚本的输入和输出。 I want to display that information in the UI once they've highlighted the script, but not selected to run it, and I can't seem to get access to the docstrings from the base program.一旦他们突出显示了脚本，但没有选择运行它，我想在 UI 中显示该信息，而且我似乎无法从基本程序访问文档字符串。

ex.前任。

test.py

"""this is a docstring"""
print('hello world')

program.py

index is test.py for this example, but is normally not known because it's whatever the user has selected in the GUI.在本例中index是test.py ，但通常不知道，因为它是用户在 GUI 中选择的任何内容。

# index is test.py
def on_selected(self, index):
    script_path = self.tree_view_model.filePath(index)
    fparse = ast.parse(''.join(open(script_path)))
    self.textBrowser_description.setPlainText(ast.get_docstring(fparse))

Answer 1

Let's the docstring you want to access belongs to the file, file.py .让我们要访问的文档字符串属于文件file.py 。

You can get the docstring by doing the following:您可以通过执行以下操作获取文档字符串：

import file
print(file.__doc__)

If you want to get the docstring before you import it then the you could read the file and extract the docstring.如果您想在导入之前获取文档字符串，那么您可以读取文件并提取文档字符串。 Here is an example:这是一个例子：

import re


def get_docstring(file)
    with open(file, "r") as f:
        content = f.read()  # read file
        quote = content[0]  # get type of quote
        pattern = re.compile(rf"^{quote}{quote}{quote}[^{quote}]*{quote}{quote}{quote}")  # create docstring pattern
    return re.findall(pattern, content)[0][3:-3]  # return docstring without quotes


print(get_docstring("file.py"))

Note: For this regex to work the docstring will need to be at the very top.注意：要使此正则表达式起作用，文档字符串需要位于最顶部。

Answer 2

Here's how to get it via importlib .这是通过importlib获取它的方法。 Most of the logic has been put in a function.大部分逻辑都放在了 function 中。 Note that using importlib does import the script (which causes all its top-level statements to be executed), but the module itself is discarded when the function returns.请注意，使用importlib确实会导入脚本（这会导致执行其所有顶级语句），但是当 function 返回时，模块本身会被丢弃。

If this was the script docstring_test.py in the current directory that I wanted to get the docstring from:如果这是我想从中获取文档字符串的当前目录中的脚本docstring_test.py ：

""" this is a multiline
    docstring.
"""
print('hello world')

Here's how to do it:这是如何做到的：

import importlib.util

def get_docstring(script_name, script_path):
    spec = importlib.util.spec_from_file_location(script_name, script_path)
    foo = importlib.util.module_from_spec(spec)
    spec.loader.exec_module(foo)
    return foo.__doc__

if __name__ == '__main__':

    print(get_docstring('docstring_test', "./docstring_test.py"))

Output: Output：

hello world
 this is a multiline
    docstring.

Update:更新：

Here's how to do it by letting the ast module in the standard library do the parsing which avoids both importing/executing the script as well as trying to parse it yourself with a regex.这是通过让标准库中的ast模块进行解析来避免导入/执行脚本以及尝试使用正则表达式自己解析它的方法。

This looks more-or-less equivalent to what's in your question, so it's unclear why what you have isn't working for you.这看起来或多或少等同于您的问题，因此尚不清楚为什么您所拥有的对您不起作用。

import ast

def get_docstring(script_path):
    with open(script_path, 'r') as file:
        tree = ast.parse(file.read())
        return ast.get_docstring(tree, clean=False)

if __name__ == '__main__':

    print(repr(get_docstring('./docstring_test.py')))

Output: Output：

' this is a multiline\n    docstring.\n'

如何从单独的脚本访问文档字符串？

问题描述

2 个解决方案

解决方案1
2 2021-03-01 22:41:00

解决方案2
1 已采纳 2021-03-01 23:34:21

如何从单独的脚本访问文档字符串？

问题描述

2 个解决方案

解决方案1 2 2021-03-01 22:41:00

解决方案2 1 已采纳 2021-03-01 23:34:21

解决方案1
2 2021-03-01 22:41:00

解决方案2
1 已采纳 2021-03-01 23:34:21