<html><head><title>The Dormouse's story</title></head><body><p class="title" name="dromouse"><b>The Dormouse's story</b></p><p class="story">Once upon a time there were three little sisters; and their names were<a href="0f9K9s2c8@1M7q4)9K6b7g2)9J5c8W2)9J5c8X3g2^5j5h3#2H3L8r3g2Q4x3X3g2U0L8$3#2Q4x3V1k6W2L8s2y4A6k6g2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5$3I4S2M7%4y4Q4x3@1c8Q4x3U0k6I4N6h3!0@1i4K6y4n7M7$3W2K6N6r3g2J5i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0A6k6q4)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0D9K9h3&6C8x3g2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6Y4N6q4)9K6b7W2)9J5y4X3I4@1i4K6y4n7i4K6t1I4i4K6u0V1i4K6u0V1i4K6t1$3L8X3u0K6M7q4)9K6b7V1g2D9M7$3W2W2i4K6t1$3L8X3u0K6M7q4)9K6b7W2)9J5k6q4)9J5k6q4)9J5y4X3N6@1i4K6y4n7i4K6t1$3L8s2c8Q4x3@1u0Q4x3V1k6S2i4K6t1$3k6%4c8Q4x3@1u0Q4x3V1y4Q4x3U0k6D9N6q4)9K6b7X3q4Q4x3U0k6F1j5Y4y4H3i4K6y4n7K9s2u0W2k6W2)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0Z5N6s2c8H3i4K6y4m8i4K6u0r3i4K6u0r3k6i4S2S2L8i4m8D9k6g2)9J5k6h3y4G2L8g2)9J5c8X3I4S2j5$3W2W2i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0U0L8r3q4K6M7#2)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0K6K9i4y4@1k6i4u0Q4x3U0k6I4N6h3!0@1i4K6y4n7i4K6t1$3L8X3u0K6M7q4)9K6b7X3W2V1i4K6y4p5i4K6t1$3M7i4g2G2N6q4)9K6b7X3I4A6L8X3D9J5i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4X3N6@1i4K6y4n7e0r3q4U0K9h3g2Q4x3U0k6D9N6q4)9K6b7W2)9J5c8X3q4Q4x3U0k6Y4N6q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0S2L8X3c8Q4x3U0k6D9N6q4)9K6b7X3q4Q4x3U0k6F1j5Y4y4H3i4K6y4n7K9s2u0W2k6W2)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0Z5N6s2c8H3i4K6y4m8i4K6u0r3i4K6u0r3k6i4S2S2L8i4m8D9k6g2)9J5k6h3y4G2L8g2)9J5c8Y4c8A6L8r3I4A6k6g2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5$3I4S2M7%4y4Q4x3@1c8Q4x3U0k6I4N6h3!0@1i4K6y4n7M7$3W2K6N6r3g2J5i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0A6k6q4)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0D9K9h3&6C8x3#2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6Y4N6q4)9K6b7W2c8A6L8r3I4A6k6g2)9J5y4X3I4@1i4K6y4n7i4K6u0r3j5g2)9J5y4X3N6@1i4K6y4n7i4K6y4n7j5h3&6V1i4K6t1$3L8X3u0K6M7q4)9K6b7Y4c8Z5k6i4W2Q4x3U0k6F1j5Y4y4H3i4K6y4n7L8r3W2$3k6h3c8Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5i4c8Q4x3U0k6F1j5Y4y4H3i4K6y4n7N6r3S2W2i4K6t1$3L8X3u0K6M7q4)9K6b7X3u0G2N6s2c8G2L8g2)9J5y4X3&6T1M7%4m8Q4x3@1u0G2k6W2)9J5y4X3&6T1M7%4m8Q4x3@1u0S2i4K6t1$3L8X3u0K6M7q4)9K6b7Y4N6W2L8r3I4Q4x3X3g2Q4x3U0k6D9N6q4)9K6b7W2)9J5c8Y4m8Q4x3U0k6Y4N6q4)9K6b7W2)9J5y4X3I4@1i4K6y4n7M7q4)9J5y4X3&6T1M7%4m8Q4x3@1u0U0L8r3q4K6M7#2)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0K6N6r3!0J5P5g2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6Y4N6q4)9K6b7W2)9J5k6g2)9J5k6g2)9J5k6g2)9J5y4X3I4@1i4K6y4n7i4K6u0r3M7q4)9J5y4X3N6@1i4K6y4n7i4K6t1$3L8s2c8Q4x3@1u0Q4x3V1k6T1L8$3c8&6i4K6t1$3k6%4c8Q4x3@1u0Q4x3U0k6D9N6q4)9K6b7W2)9J5c8X3S2@1L8h3I4Q4x3U0k6Y4N6q4)9K6b7R3`.`.
用BeautifulSoup创建一个对象
>>> from bs4 import BeautifulSoup>>> html = """... <html>... <head>... <title>The Dormouse's story</title>... </head>... <body>... <p class="title"><b>The Dormouse's story</b></p>...... <p class="story">Once upon a time there were three little sisters; and their names were... <a href="cd6K9s2c8@1M7q4)9K6b7g2)9J5c8W2)9J5c8X3g2^5j5h3#2H3L8r3g2Q4x3X3g2U0L8$3#2Q4x3V1k6W2L8s2y4A6k6g2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5$3I4S2M7%4y4Q4x3@1c8Q4x3U0k6I4N6h3!0@1i4K6y4n7M7$3W2K6N6r3g2J5i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0A6k6q4)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0D9K9h3&6C8x3g2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6Y4N6q4)9K6b7V1g2D9M7$3W2W2i4K6t1$3L8s2c8Q4x3@1u0Q4x3V1k6S2i4K6t1$3k6%4c8Q4x3@1u0Q4x3V1y4Q4x3X3g2Q4x3X3g2Q4x3X3g2Q4x3U0k6F1j5Y4y4H3i4K6y4n7i4K6t1$3L8s2c8Q4x3@1u0S2i4K6t1$3L8X3u0K6M7q4)9K6b7X3S2J5k6h3k6Q4x3@1c8Q4x3U0k6I4N6h3!0@1i4K6y4n7K9s2c8@1M7q4)9K6b7g2)9J5c8W2)9J5c8X3g2^5j5h3#2H3L8r3g2Q4x3X3g2U0L8$3#2Q4x3V1k6D9j5h3y4A6k6g2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5$3I4S2M7%4y4Q4x3@1c8Q4x3U0k6I4N6h3!0@1i4K6y4n7M7$3W2K6N6r3g2J5i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0A6k6q4)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0D9K9h3&6C8x3W2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6Y4N6q4)9K6b7V1I4S2j5$3W2W2i4K6t1$3L8s2c8Q4x3@1u0Q4x3V1k6S2i4K6t1$3k6%4c8Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5h3&6V1i4K6u0W2i4K6u0W2i4K6u0W2i4K6t1$3L8X3u0K6M7q4)9K6b7W2)9J5y4X3I4@1i4K6y4n7j5g2)9J5y4X3&6T1M7%4m8Q4x3@1u0Z5M7X3g2X3i4K6y4p5i4K6t1$3M7i4g2G2N6q4)9K6b7X3S2@1N6s2m8Q4x3@1q4Q4x3V1k6Q4x3V1k6W2P5r3q4E0M7r3I4W2i4K6u0W2j5$3!0E0i4K6u0r3N6r3W2D9L8r3W2W2i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0U0L8r3q4K6M7#2)9K6c8q4)9J5y4Y4q4#2L8%4c8Q4x3@1u0K6K9i4y4@1k6i4u0Q4x3U0k6I4N6h3!0@1i4K6y4n7i4K6t1$3L8X3u0K6M7q4)9K6b7X3W2V1i4K6y4p5i4K6t1$3M7i4g2G2N6q4)9K6b7X3I4A6L8X3D9K6i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4X3N6@1i4K6y4n7g2r3W2D9L8r3W2W2i4K6t1$3L8s2c8Q4x3@1u0Q4x3V1k6S2i4K6t1$3k6%4c8Q4x3@1u0Q4x3@1u0Q4x3X3g2Q4x3X3g2Q4x3X3g2Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5h3&6V1i4K6t1$3L8X3u0K6M7q4)9K6b7Y4c8Z5k6i4W2Q4x3U0k6F1j5Y4y4H3i4K6y4n7L8r3W2$3k6h3c8Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5i4c8Q4x3U0k6F1j5Y4y4H3i4K6y4n7N6r3S2W2i4K6t1$3L8X3u0K6M7q4)9K6b7X3u0G2N6s2c8G2L8g2)9J5y4X3&6T1M7%4m8Q4x3@1u0G2k6W2)9J5y4X3&6T1M7%4m8Q4x3@1u0S2i4K6t1$3L8X3u0K6M7q4)9K6b7Y4N6W2L8r3I4Q4x3X3g2Q4x3U0k6D9N6q4)9K6b7W2)9J5c8Y4m8Q4x3U0k6Y4N6q4)9K6b7W2)9J5k6g2)9J5k6g2)9J5k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0Q4x3U0k6D9N6q4)9K6b7Y4m8Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5$3I4S2M7%4y4Q4x3@1c8Q4x3U0k6I4N6h3!0@1i4K6y4n7M7%4c8G2M7Y4W2Q4x3U0k6I4N6h3!0@1i4K6y4n7i4K6t1$3k6%4c8Q4x3@1u0Q4x3X3g2Q4x3X3g2Q4x3X3g2Q4x3U0k6D9N6q4)9K6b7W2)9J5c8Y4m8Q4x3U0k6Y4N6q4)9K6b7W2)9J5k6g2)9J5k6g2)9J5k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0Q4x3U0k6D9N6q4)9K6b7W2)9J5c8X3u0G2k6s2W2Q4x3U0k6Y4N6q4)9K6b7W2)9J5k6g2)9J5k6g2)9J5k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0Q4x3U0k6D9N6q4)9K6b7W2)9J5c8X3S2@1L8h3I4Q4x3U0k6Y4N6q4)9K6b7W2)9J5k6g2)9J5k6g2)9J5k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0Q4x3U0k6I4N6h3!0@1i4K6y4n7i4K6t1$3M7i4g2G2N6q4)9K6b7W2)9J5y4Y4q4#2L8%4c8Q4x3@1u0Q4x3U0k6Y4N6q4)9K6b7W2)9J5y4X3N6@1i4K6y4n7i4K6t1$3k6%4c8Q4x3@1u0Q4x3U0k6Y4N6q4)9K6b7W2)9J5y4X3N6@1i4K6y4n7i4K6t1$3k6%4c8Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7M7$3!0#2M7q4)9J5y4X3&6T1M7%4m8Q4x3@1u0Q4x3@1c8Q4x3U0k6F1j5Y4y4H3i4K6y4n7b7X3g2S2N6i4c8A6k6Y4g2D9f1$3!0#2M7q4)9J5z5r3S2@1L8h3I4Q4x3U0W2o6i4K6y4m8i4K6g2o6f1s2W2@1K9r3!0F1x3U0N6Q4y4f1y4D9K9h3u0Q4y4f1y4K6K9i4c8W2i4K6u0V1M7r3q4U0K9$3q4Y4k6i4y4Q4y4f1y4T1M7K6c8Q4y4f1y4Q4y4h3k6Q4y4h3k6A6L8X3W2@1i4K6g2X3i4K6g2X3i4K6u0W2M7s2W2Q4x3@1p5I4z5o6q4Q4x3@1q4Q4x3U0k6F1j5Y4y4H3i4K6y4n7g2i4y4W2M7W2N6S2M7X3&6A6L8X3N6Q4x3@1q4Q4x3U0k6F1j5Y4y4H3i4K6y4n7e0X3!0Q4x3U0k6F1j5Y4y4H3i4K6y4n7M7r3q4J5M7$3g2J5i4K6t1$3L8X3u0K6M7q4)9K6b7Y4N6S2M7#2)9J5y4X3&6T1M7%4m8Q4x3@1u0W2P5s2m8D9K9h3y4A6N6r3I4&6i4K6t1$3L8X3u0K6M7q4)9K6b7Y4y4H3k6h3y4A6k6X3W2W2k6q4)9J5b7#2)9J5y4X3&6T1M7%4m8Q4x3@1u0K6L8#2)9J5y4X3&6T1M7%4m8Q4x3@1u0u0i4K6t1$3i4K6t1K6x3K6W2Q4x3@1u0E0i4K6t1$3L8X3u0K6M7q4)9K6b7Y4g2K6K9h3&6Y4i4K6t1$3L8X3u0K6M7q4)9K6b7Y4c8Z5k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0T1k6i4y4@1i4K6t1$3L8X3u0K6M7q4)9K6b7X3q4$3j5h3W2D9j5h3u0D9k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0t1g2p5#2x3i4K6t1$3L8X3u0K6M7q4)9K6b7Y4m8S2M7Y4y4W2M7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0X3L8%4u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7N6r3S2A6M7#2)9J5y4X3&6T1M7%4m8Q4x3@1u0K6P5i4y4@1k6h3#2Q4x3U0k6F1j5Y4y4H3i4K6y4n7i4K6t1^5i4K6t1$3M7i4g2G2N6q4)9K6b7X3S2@1L8h3I4Q4x3X3g2H3j5i4u0K6k6i4u0Q4x3U0k6I4N6h3!0@1i4K6y4n7i4K6t1&6i4K6u0W2i4K6t1$3L8X3u0K6M7q4)9K6b7W2c8Z5K9i4y4Q4x3U0k6F1j5Y4y4H3i4K6y4n7N6i4y4#2j5h3I4D9P5g2)9J5y4X3&6T1M7%4m8Q4x3@1u0A6M7$3&6Q4x3U0k6Q4x3U0x3K6z5g2)9K6b7Y4c8Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5g2)9J5y4X3&6T1M7%4m8Q4x3@1u0H3M7X3!0T1L8r3g2E0i4K6u0o6i4K6t1$3L8X3u0K6M7q4)9K6b7X3u0#2N6q4)9J5y4X3&6T1M7%4m8Q4x3@1u0A6k6W2)9J5y4X3&6T1M7%4m8Q4x3@1u0&6L8%4g2Q4x3U0k6F1j5Y4y4H3i4K6y4n7M7Y4g2F1i4K6t1$3L8X3u0K6M7q4)9K6b7Y4c8Z5K9i4y4Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5$3!0V1k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0G2L8W2)9J5y4X3&6T1M7%4m8Q4x3@1u0S2L8X3!0@1K9r3g2J5i4K6t1$3L8X3u0K6M7q4)9K6b7Y4y4&6M7%4c8W2L8g2)9J5b7#2)9J5y4X3&6T1M7%4m8Q4x3@1u0G2M7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0A6L8W2)9J5y4X3&6T1M7%4m8Q4x3@1u0S2i4K6t1$3L8X3u0K6M7q4)9K6b7X3c8A6k6X3k6W2M7X3g2F1N6q4)9J5y4X3&6T1M7%4m8Q4x3@1u0$3K9i4u0@1N6h3q4D9i4K6t1$3L8X3u0K6M7q4)9K6b7X3g2F1N6X3W2J5L8$3&6E0k6h3&6@1i4K6u0o6i4K6t1$3L8X3u0K6M7q4)9K6b7X3W2@1i4K6t1$3L8X3u0K6M7q4)9K6b7X3#2S2P5g2)9J5y4X3&6T1M7%4m8Q4x3@1u0#2M7$3g2Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5g2)9J5y4X3&6T1M7%4m8Q4x3@1u0V1K9h3k6X3k6i4u0W2L8Y4c8Q4x3U0k6F1j5Y4y4H3i4K6y4n7M7r3q4J5M7$3g2J5i4K6t1$3L8X3u0K6M7q4)9K6b7X3q4F1k6q4)9J5y4X3&6T1M7%4m8Q4x3@1u0T1k6h3S2S2N6X3g2Q4x3U0k6F1j5Y4y4H3i4K6y4n7k6r3W2X3k6X3g2J5k6h3&6@1L8s2W2Q4x3X3g2f1K9r3g2Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5$3!0V1k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0@1K9r3q4@1i4K6t1$3L8X3u0K6M7q4)9K6b7X3y4S2N6i4y4W2k6q4)9J5y4X3&6T1M7%4m8Q4x3@1u0@1K9r3W2K6i4K6t1$3L8X3u0K6M7q4)9K6b7Y4N6S2M7X3&6A6L8X3N6Q4x3U0k6F1j5Y4y4H3i4K6y4n7K9i4y4Q4x3U0k6F1j5Y4y4H3i4K6y4n7L8$3&6Q4x3U0k6F1j5Y4y4H3i4K6y4n7L8r3W2F1k6g2)9J5y4X3&6T1M7%4m8Q4x3@1t1I4i4K6t1$3L8X3u0K6M7q4)9K6b7X3!0X3i4K6t1$3L8X3u0K6M7q4)9K6b7Y4c8Z5k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0X3K9h3I4W2i4K6t1$3L8X3u0K6M7q4)9K6b7W2)9J5y4X3I4@1i4K6y4n7M7%4c8V1K9h3&6Q4x3U0k6Y4N6q4)9K6b7W2)9J5k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0f1L8#2)9J5y4X3&6T1M7%4m8Q4x3@1u0Y4k6i4c8Q4x3U0k6F1j5Y4y4H3i4K6y4n7M7X3W2V1i4K6t1$3L8X3u0K6M7q4)9K6b7X3!0X3i4K6t1$3L8X3u0K6M7q4)9K6b7Y4c8Z5K9i4y4Q4x3U0k6F1j5Y4y4H3i4K6y4n7N6$3q4J5L8X3W2F1k6#2)9J5b7#2)9J5y4X3&6T1M7%4m8Q4x3@1u0U0K9r3q4F1k6$3g2Q4x3U0k6F1j5Y4y4H3i4K6y4n7j5$3!0V1k6g2)9J5y4X3&6T1M7%4m8Q4x3@1u0@1K9r3q4@1i4K6t1$3L8X3u0K6M7q4)9K6b7X3I4G2L8$3E0K6i4K6t1$3L8X3u0K6M7q4)9K6b7X3I4A6K9$3g2Q4x3U0k6F1j5Y4y4H3i4K6y4n7N6r3S2A6M7#2)9K6b7g2)9J5y4X3&6T1M7%4m8Q4x3@1u0n7k6h3q4#2N6r3W2X3N6h3I4e0L8%4g2H3i4K6t1^5h3f1!0g2f1W2)9#2k6V1#2m8f1V1E0g2f1q4)9%4c8q4)9J5z5i4c8G2i4K6t1$3L8X3u0K6M7q4)9K6b7Y4c8Z5K9i4y4Q4x3@1q4Q4x3U0k6F1j5Y4y4H3i4K6y4n7b7X3g2S2N6i4c8A6k6Y4g2D9f1$3!0#2M7q4)9J5z5q4W2a6g2g2u0Q4y4h3k6y4b7g2u0w2g2g2m8Q4x3V1y4Q4x3U0k6F1j5Y4y4H3i4K6y4n7i4K6t1$3M7i4g2G2N6q4)9K6b7X3S2@1L8h3I4Q4x3X3g2H3j5i4u0K6k6i4u0Q4x3U0k6I4N6h3!0@1i4K6y4n7i4K6t1&6i4K6t1$3L8X3u0K6M7q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0E0j5i4u0C8N6i4m8Q4y4h3k6@1P5i4m8W2i4K6y4p5L8h3q4J5K9%4g2H3i4K6g2X3N6s2W2H3k6g2)9J5z5g2)9J5z5b7`.`.
>>> for link in soup.find_all('a'):... print(link.get('href'))...[url]447K9s2c8@1M7q4)9K6b7g2)9J5c8W2)9J5c8X3g2^5j5h3#2H3L8r3g2Q4x3X3g2U0L8$3#2Q4x3V1k6W2L8s2y4A6k6g2)9#2b7W2)9J5c8Y4g2J5L8q4)9#2c8q4)9#2b7Y4g2J5L8q4)9#2c8r3S2@1N6s2m8Q4x3@1q4Q4x3V1k6Q4x3V1k6W2P5r3q4E0M7r3I4W2i4K6u0W2j5$3!0E0i4K6u0r3L8r3q4U0K9h3g2Q4y4f1u0Q4x3V1k6#2M7X3I4Q4y4f1c8Q4y4f1u0#2M7X3I4Q4y4f1c8Z5N6s2c8H3i4K6y4m8i4K6u0r3i4K6u0r3k6i4S2S2L8i4m8D9k6g2)9J5k6h3y4G2L8g2)9J5c8Y4c8A6L8r3I4A6k6g2)9#2b7W2)9J5c8Y4g2J5L8q4)9#2c8l9`.`.
那如果我要爬去所有的文字信息呢?
就要用到下面的命令了:
>>> print soup.get_text()The Dormouse's storyThe Dormouse's storyOnce upon a time there were three little sisters; and their names wereElsie,Lacie andTillie;and they lived at the bottom of a well....
接下来,咱们写一个简单的爬虫,调用站长帮手,写一个查询子域名的工具。
首先,咱们抓包分析一下,这里用到的是Burp
POST /subdomain/ HTTP/1.1Host: i.links.cnContent-Length: 34Cache-Control: max-age=0Origin: [url]18aK9s2c8@1M7q4)9K6b7g2)9J5c8W2)9J5c8X3W2Q4x3X3g2D9K9h3&6C8M7#2)9J5k6h3y4F1i4K6g2n7i4K6u0r3N6i4u0D9i4K6g2p5g2i4m8Y4M7X3q4V1k6g2)9J5k6p5W2F1M7$3g2U0N6i4u0W2i4K6u0V1f1X3g2I4N6h3g2K6N6s2y4Q4x3@1q4Q4x3U0k6F1j5Y4y4H3i4K6y4n7x3g2g2K6k6i4u0Q4x3X3c8m8k6$3g2F1N6q4)9K6b7g2)9J5y4X3&6T1M7%4m8Q4x3@1u0y4L8%4A6A6L8r3I4S2i4K6u0r3y4g2)9J5k6e0m8Q4x3U0k6F1j5Y4y4H3i4K6y4n7i4K6t1^5g2$3W2F1k6r3!0%4M7#2)9J5y4X3&6T1M7%4m8Q4x3@1u0z5g2q4)9J5y4X3&6T1M7%4m8Q4x3@1t1I4x3q4)9J5k6e0m8Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7g2@1!0i4y4U0c8Q4x3U0W2Q4x3U0k6F1j5Y4y4H3i4K6y4n7b7i4m8H3L8r3g2i4k6h3u0w2K9i4c8Q4x3V1j5#2x3K6N6Q4x3X3f1K6y4W2)9J5y4X3&6T1M7%4m8Q4x3@1u0Q4x3U0S2w2d9q4c8y4e0q4)9J5b7#2)9J5y4X3&6T1M7%4m8Q4x3@1u0D9K9h3E0W2i4K6t1$3L8X3u0K6M7q4)9K6b7V1N6W2j5$3E0G2i4K6t1&6i4K6t1$3L8X3u0K6M7q4)9K6b7V1y4Z5M7X3!0E0k6g2)9J5c8U0f1#2i4K6u0W2x3q4)9J5k6e0t1^5z5o6y4Q4x3X3f1^5y4#2)9J5y4X3&6T1M7%4m8Q4x3@1u0e0j5h3k6S2M7X3W2Q4x3V1j5#2x3K6N6Q4x3X3f1K6y4V1y4G2L8Y4c8W2L8Y4c8Q4x3X3c8f1P5i4m8W2i4K6y4m8i4K6t1$3L8X3u0K6M7q4)9K6b7X3q4H3M7r3I4A6j5$3q4@1K9h3!0F1i4K6u0r3P5q4)9J5k6s2N6%4N6#2)9J5k6r3k6G2M7X3#2Q4x3X3c8#2M7X3I4W2L8X3y4G2k6r3g2V1b7h3y4U0k6i4m8@1i4K6y4m8i4K6t1$3L8X3u0K6M7q4)9K6b7Y4c8W2P5s2c8Q4x3V1k6Z5N6r3#2D9i4K6u0o6j5i4m8H3L8r3W2U0j5i4c8A6L8$3&6Q4x3V1k6^5K9s2c8E0L8q4)9J5b7Y4S2E0L8q4)9J5b7$3q4H3M7r3I4A6j5$3q4@1K9h3!0F1i4K6u0r3P5r3#2D9i4K6y4n7M7g2)9K6c8o6m8Q4x3X3f1&6i4K6u0o6K9h3#2S2k6$3g2Q4x3V1k6%4k6h3u0H3i4K6u0o6i4K6u0m8i4K6u0r3i4K6u0m8i4K6y4n7M7g2)9K6c8o6m8Q4x3X3f1^5f1X3g2X3k6i4u0W2M7W2)9K6b7g2)9J5y4X3&6T1M7%4m8Q4x3@1u0Q4y4f1u0#2M7X3I4Q4y4f1c8Z5N6s2c8H3i4K6y4m8i4K6u0r3i4K6u0r3K9g2)9J5k6h3I4A6L8X3E0K6i4K6u0W2j5$3&6Q4x3V1k6K6N6h3u0V1L8$3#2S2K9h3&6Q4x3V1k6Q4y4f1u0Q4x3V1k6#2M7X3I4Q4y4f1c8m8j5$3y4W2M7s2c8Q4x3X3c8x3j5h3&6Y4N6h3q4Y4k6g2)9K6b7g2)9J5y4X3&6T1M7%4m8Q4x3@1u0*7K9q4)9J5k6p5y4z5i4K6u0o6P5X3S2Q4x3@1u0I4i4K6y4p5x3q4)9J5k6e0S2o6L8$3!0C8K9h3g2Q4x3@1q4Q4x3U0k6F1j5Y4y4H3i4K6y4n7b7g2y4b7f1@1g2e0f1@1W2a6e0V1W2p5b7@1y4d9f1#2u0o6f1g2y4Q4x3@1c8z5c8V1k6z5b7V1!0p5b7@1&6m8b7V1q4o6d9f1N6a6c8f1!0p5c8p5k6w2e0p5N6Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7i4K6g2X3i4K6g2X3k6%4g2A6k6q4)9K6c8o6p5J5x3U0t1@1y4K6b7^5i4K6u0W2x3e0V1I4x3U0l9^5y4U0p5@1y4U0R3@1z5e0R3J5x3o6M7H3x3q4)9J5k6e0p5#2x3o6x3@1z5o6p5J5y4U0f1K6z5e0g2Q4x3X3f1&6x3K6R3#2i4K6y4n7i4K6t1$3L8X3u0K6M7q4)9K6b7W2g2y4i4K6g2X3k6r3W2K6N6r3W2F1j5%4c8A6k6q4)9K6c8o6p5#2k6e0m8W2y4K6M7^5x3o6l9^5x3X3c8V1i4K6u0V1x3r3j5I4z5e0N6V1y4o6t1&6x3h3c8V1j5h3q4Q4x3X3b7#2k6o6c8W2x3U0p5I4k6W2)9J5k6o6q4X3j5e0b7H3x3q4)9J5k6o6p5#2k6e0m8W2y4K6M7^5x3o6l9&6x3h3f1$3i4K6y4n7i4K6t1$3L8X3u0K6M7q4)9K6b7X3I4A6L8X3E0Z5k6h3I4H3k6i4u0Q4x3@1c8K6j5h3#2W2K9i4m8T1x3#2)9K6c8o6q4Q4x3U0k6S2L8i4m8Q4x3@1u0K6j5h3#2W2K9i4m8T1y4q4)9K6c8o6q4Q4x3U0k6S2L8i4m8Q4x3@1u0K6j5h3#2W2K9i4m8T1x3W2)9K6c8o6q4Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7M7$3g2J5N6X3g2J5N6i4u0D9i4K6y4p5i4K6y4n7i4K6t1$3L8X3u0K6M7q4)9K6b7V1q4e0f1q4y4q4f1#2y4u0e0@1&6u0c8q4q4m8f1W2u0e0b7g2u0d9i4K6y4p5c8p5&6o6c8V1#2q4b7f1c8s2b7V1u0r3e0@1W2o6f1p5N6w2e0f1k6o6e0W2m8w2i4K6y4n7i4K6t1$3L8X3u0K6M7q4)9K6b7Y4y4S2k6X3g2V1L8$3N6Q4x3X3c8X3L8r3!0%4i4K6u0V1K9i4c8W2L8g2)9K6c8q4)9K6b7W2)9J5y4X3&6T1M7%4m8Q4x3@1u0E0L8$3&6A6N6r3!0J5i4K6g2X3j5$3!0#2L8Y4c8Q4x3@1b7J5i4K6y4n7i4K6t1$3L8X3u0K6M7q4)9K6b7Y4g2E0K9h3c8Q4x3@1c8#2L8h3W2V1i4K6y4p5k6U0b7@1z5h3t1I4x3e0k6W2x3o6N6V1x3h3b7@1k6U0y4V1x3X3c8U0y4e0x3#2x3X3t1%4k6X3g2V1k6e0W2Q4x3U0k6S2L8i4m8Q4x3@1u0I4N6h3g2J5P5i4c8A6L8h3g2Q4x3@1b7J5x3o6p5%4i4K6t1#2x3V1b7^5i4K6t1#2x3V1b7J5y4q4)9J5b7U0p5@1i4K6t1#2x3@1p5H3z5g2)9J5y4e0y4m8x3o6W2Q4x3@1u0Q4x3U0k6F1j5Y4y4H3i4K6y4n7b7@1&6K9h3V1c8m8g2p5p5K6x3o6l9I4x3U0x3K6y4#2)9K6c8r3y4F1P5Y4A6Q4y4h3k6W2K9h3c8Q4x3U0f1K6c8o6t1J5y4U0x3%4x3e0f1&6y4g2)9J5k6o6p5#2x3o6x3@1y4K6R3&6z5o6W2Q4x3X3c8Q4x3U0f1J5y4X3&6@1K9h3#2W2i4K6t1#2x3@1b7I4y4e0l9K6y4e0f1@1y4K6f1I4b7$3!0F1L8X3g2U0N6r3W2G2L8W2)9K6b7g2)9J5y4X3&6T1M7%4m8Q4x3@1u0U0L8r3!0K6k6h3c8G2L8h3q4A6L8W2)9K6c8r3W2U0K9s2g2F1M7h3W2#2i4K6u0W2j5$3!0E0i4K6t1$3j5h3#2H3i4K6y4n7j5U0u0Q4x3@1b7I4i4K6t1$3j5h3#2H3i4K6y4n7j5U0y4Q4x3@1b7I4i4K6t1$3j5h3#2H3i4K6y4n7j5U0c8Q4x3@1b7I4
Traceback (most recent call last): File "demo.py", line 8, in <module> print r.textUnicodeEncodeError: 'gbk' codec can't encode character u'\\xcf' in position 386: illegal multibyte sequence