bitbucket.org/fpz/brown/dom | Package dom seriously needs to be cleaned up at some point. |
bitbucket.org/fpz/dom/htmldom | Package htmldom is a simple wrapper around the golang.org/x/net/html structures to implement the dom interfaces. |
bitbucket.org/zaphar/go-html-transform/css/selector | Package selector contains a css3 selector parser. |
bitbucket.org/zaphar/go-html-transform/h5 | Package h5 implements a wrapper and DSL for golang.org/x/net/html. |
bitbucket.org/zaphar/go-html-transform/html/transform | Package transform implements a html css selector and transformer. |
bosun.org/cmd/scollector/collectors | |
gist.github.com/5295453.git | Parsing XML export of wordpress blog for converting to common blog format. |
github.com/300brand/coverage/article/body | |
github.com/300brand/coverage/article/filter | These always return false. |
github.com/3ft9/GoOse | Html Content / Article Extractor in Golang |
github.com/42wim/bm | |
github.com/9uuso/sanitize | Package sanitize provides functions for sanitizing text. |
github.com/aarzilli/sandblast | Library that uses Readability-like heuristics to extract text from an HTML document |
github.com/aarzilli/sandblast/example | |
github.com/aarzilli/sandblast/qa | |
github.com/abiosoft/GoOse | Html Content / Article Extractor in Golang |
github.com/advancedlogic/GoOse | Html Content / Article Extractor in Golang |
github.com/advancedlogic/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/agl/xmpp-client | |
github.com/agnivade/chatwebsocket | |
github.com/aishraj/gowler | A toy web crawler written in go |
github.com/amauragis/sanitize | Package sanitize provides functions for sanitizing text. |
github.com/Anaminus/rbxweb | The rbxweb package provides an interface to many of ROBLOX's web-based services. |
github.com/andlabs/ohv | 7 october 2014 |
github.com/andrew-d/goscrape/extract | |
github.com/andybalholm/cascadia | The cascadia package is an implementation of CSS selectors. |
github.com/andybalholm/redwood | Redwood is an internet content-filtering program. |
github.com/andyleap/microformats | Microformats V2 parser library |
github.com/AndyNortrup/TweetHarvest | A simple Go Map-Reduce system that queries twitter for key words, pulls addresses from the tweets and produces an RSS feed with the most popular links to be read later. |
github.com/antiphy/cascadia | The cascadia package is an implementation of CSS selectors. |
github.com/AntoineAugusti/crawler/fetchers | |
github.com/apoydence/hydra/examples/webCrawler | |
github.com/appc/acbuild/Godeps/_workspace/src/github.com/appc/spec/discovery | Package discovery contains an experimental implementation of the Image Discovery section of the appc specification. |
github.com/appc/spec/discovery | Package discovery contains an experimental implementation of the Image Discovery section of the appc specification. |
github.com/appc/spec/Godeps/_workspace/src/golang.org/x/net/html/charset | Package charset provides common text encodings for HTML documents. |
github.com/aranair/gosnap/crawler | |
github.com/arnehormann/hck | HTML construction kit |
github.com/aymerick/douceur/inliner | |
github.com/bcampbell/arts/arts | |
github.com/bcampbell/arts/discover | |
github.com/bcampbell/arts/scrapetool | |
github.com/belak/seabird-plugins | The default set of plugins for the seabird bot framework |
github.com/belak/seabird-plugins/linkproviders | |
github.com/benlemasurier/crawler | |
github.com/BenLubar/htmlcleaner | Scrub that HTML clean! |
github.com/bikbah/doc-converter | |
github.com/biogo/ncbi/blast | Package blast provides support for interaction with the NCBI BLAST service. |
github.com/bkaradzic/GoOse | Html Content / Article Extractor in Golang |
github.com/booyaa/streaker/ghcontribs | |
github.com/bslatkin/tweeps2opml | |
github.com/caesarxuchao/kubernetes/pkg/util/proxy | Package proxy provides transport and upgrade support for proxies |
github.com/cdjj/goread | google reader clone in go on app engine |
github.com/cdjj/goread/sanitizer | |
github.com/chai2010/gopkg/encoding/markdown | Package markdown implements markdown processor. |
github.com/chrisdambrosio/sanitize | Package sanitize provides functions for sanitizing text. |
github.com/chzyer/pocket | |
github.com/cleesmith/golang_learning/webscraper | how to run: go run webscrape.go http://cleesmith.github.io/ go run webscrape.go http://cellipede.com/ http://cleesmith.github.io/ |
github.com/client9/gospell/plaintext | |
github.com/commonsense-org/html2text | Golang HTML to plaintext conversion library |
github.com/coreos/dex/pkg/html | |
github.com/coreos/rkt/Godeps/_workspace/src/github.com/appc/spec/discovery | Package discovery contains an experimental implementation of the Image Discovery section of the appc specification. |
github.com/coreos/rkt/Godeps/_workspace/src/golang.org/x/net/html/charset | Package charset provides common text encodings for HTML documents. |
github.com/cpalone/maimai | |
github.com/cpalone/maimai.v2 | Version 2 of the MaiMai bot for euphoria.io, using the gobot framework. |
github.com/crackcomm/go-core/html/selector | |
github.com/daikikohara/enotify-slack/event | Package event provides implementations of getting events from each event provider. |
github.com/danverbraganza/shortlink/fetcher | |
github.com/datasektionen/taitan/anchor | |
github.com/davidscholberg/go-urbandict | Package urbandict provides a Go wrapper for the Urban Dictionary REST API. |
github.com/davidscholberg/irkbot/lib/modules/modpm | |
github.com/dchest/htmlmin | Package htmlmin minifies HTML. |
github.com/dchest/static-search/indexer | |
github.com/ddo/go-crawler | just an awesome crawler in go |
github.com/ddo/pick | pick stuff from html |
github.com/denischevalier/crawler | Package crawler brings interface and structs to crawl amongst websites. |
github.com/Depado/go-b0tsec/plugins/title | |
github.com/deweerdt/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/DigiExam/bluemonday | Package bluemonday provides a way of describing a whitelist of HTML elements and attributes as a policy, and for that policy to be applied to untrusted strings from users that may contain markup. |
github.com/DimShadoWWW/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/Doist/unfurlist | Package unfurlist implements a service that unfurls URLs and provides more information about them. |
github.com/draxil/node_template | node_template extends go.net/html & cascadia to make it as easy as possible to process pure HTML templates. |
github.com/drewwells/codepen | Currentbrowsers package attempts to find the most recent versions of popular browsers. |
github.com/Drpsycho/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/dsymonds/opal | Package opal provides programmatic access to Opal card information. |
github.com/dvyukov/go-fuzz/examples/html | |
github.com/dyatlov/go-htmlinfo/htmlinfo | |
github.com/dyatlov/go-opengraph/examples | |
github.com/dyatlov/go-opengraph/opengraph | |
github.com/dyatlov/go-readability | A Go implementation of the readability algorithm by arc90 labs |
github.com/eduncan911/sanitize | Package sanitize provides functions for sanitizing text. |
github.com/ericchiang/css | Package css implements CSS selector HTML search. |
github.com/ericchiang/css/example | |
github.com/ericchiang/scrape | Package scrape provides a searching api on top of golang.org/x/net/html |
github.com/ericchiang/scrape/example | |
github.com/ernesto-jimenez/scraperboard | Golang library to easily scrape websites based on simple XML declarations |
github.com/etherealmachine/markdown | A markdown parser library for Go |
github.com/fanyang01/crawler | Package crawler provides a flexible web crawler. |
github.com/fanyang01/crawler/fingerprint | |
github.com/felixrabe-go/html | Simplified HTML parsing |
github.com/fern4lvarez/go-metainspector/metainspector | go-metainspector is a web scraper package that provides access to basic info and meta tags of a given URL. |
github.com/flexiant/concerto/setup | |
github.com/frizinak/tiif | |
github.com/Garoth/weekary | |
github.com/get3w/get3w/storage | |
github.com/getlantern/aws-sdk-go/model | Package model contains functionality to generate clients for AWS APIs. |
github.com/getmillipede/millipede-go/vendor/github.com/kortschak/zalgo | Package zalgo implements a zalgo text io.Writer. |
github.com/getmillipede/millipede-go/vendor/golang.org/x/net/html/charset | Package charset provides common text encodings for HTML documents. |
github.com/gizak/docor | A simple and stupid one-page offline html doc/report generator using pure Go |
github.com/go-gitea/gitea/modules/base | |
github.com/gogits/gogs/modules/markdown | |
github.com/google/http2preload/generate | Package generate parses HTML and extracts assets suitable for constructing http2preload.Manifest. |
github.com/gophergala/ging/docindex | |
github.com/gopkg/encoding/markdown | Package markdown implements markdown processor. |
github.com/GoWiki/GoWiki | GoWiki document |
github.com/GoWiki/greentuesday | greentuesday document |
github.com/go-xiaohei/htmldown | html to markdown |
github.com/Go-zh/tools/cmd/html2article | This program takes an HTML file and outputs a corresponding article file in present format. |
github.com/grahamking/xmlpath | Package xmlpath implements a strict subset of the XPath specification for the Go language. |
github.com/groob/vquery/axiom | |
github.com/h2object/h2object/app | |
github.com/h2object/h2object/page | |
github.com/hariharan-uno/extract | Package extract provides simple methods for extracting specific items from a web page. |
github.com/hashicorp/aws-sdk-go/model | Package model contains functionality to generate clients for AWS APIs. |
github.com/hobeone/go-html-transform/css/selector | Package selector contains a css3 selector parser. |
github.com/hobeone/go-html-transform/h5 | Package h5 implements a wrapper and DSL for golang.org/x/net/html. |
github.com/hobeone/go-html-transform/html/transform | Package transform implements a html css selector and transformer. |
github.com/hobeone/rss2go/feed | |
github.com/hschaeidt/domquery | Lightweight Go implementation for querying/searching through HTML DOMs. |
github.com/hschaeidt/domquery/tokenutil | |
github.com/ianremmler/taplist | |
github.com/ideamean/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/imdario/zas | |
github.com/indraniel/github_flavored_markdown | Package github_flavored_markdown provides a GitHub Flavored Markdown renderer with fenced code block highlighting, clickable header anchor links. |
github.com/ioriiod0/go-readability | A Go implementation of the readability algorithm by arc90 labs |
github.com/isislovecruft/xmpp-client | |
github.com/JackBister/raidbot | |
github.com/JackDanger/collectlinks | This package does the extraordinarily simple operation of parsing a given piece of html and providing you with all the hyperlinks hrefs it finds. |
github.com/jamiealquiza/tim-mcrawl | |
github.com/jaytaylor/html2text | Golang HTML to plaintext conversion library |
github.com/jboursiquot/scriptsources | Package scriptsources searches through HTML for script tags and returns the URLs found in the src attributes. |
github.com/jcgregorio/piccolo | |
github.com/jcgregorio/piccolo/piccolo | |
github.com/jcrussell/tvtccal | |
github.com/jimmysawczuk/recon | Package recon scrapes URLs for OpenGraph information. |
github.com/jisaw/ISU-Salary-Scrape | |
github.com/jkamenik/crawler | |
github.com/jloup/html/nodedata | |
github.com/jmoiron/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/joelanders/xmpp-client | |
github.com/joffrey-bion/gosoup | GoSoup allows to parse HTML content and browse the produced tree. |
github.com/johannesboyne/apex/wiki | Package wiki implements a simple GitHub wiki miner and output formatter. |
github.com/johnweldon/go/html | |
github.com/johnweldon/go/web | |
github.com/joseluiscd/gosoup | A helper to explore the DOM of an HTML file. |
github.com/joshheinrichs/mumble-jukebox | |
github.com/joshuaprunier/trite | |
github.com/jpillora/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/jprobinson/eazye | The gangsta gangsta way to pull email |
github.com/jprobinson/newshound/fetch | |
github.com/jsvana/seabird/plugins | |
github.com/juju/httprequest | Package httprequest provides functionality for unmarshaling HTTP request parameters into a struct type. |
github.com/julianec/justanotherircbot | |
github.com/justintan/gox/html | |
github.com/kafkaliu/gocrawl | gocrawl is a polite, slim and concurrent web crawler written in Go. |
github.com/kardianos/spider | |
github.com/karlek/nyfiken/page | Package page contains functions which checks if a page has been updated. |
github.com/karlek/nyfiken/strip | Package strip containts functions to remove false positives from comparisons of new and last scrape. |
github.com/Kemonozume/nzbcrawler/crawler/ghost | |
github.com/kennygrant/sanitize | Package sanitize provides functions for sanitizing text. |
github.com/keybase/client/go/client | this command is only for testing purposes |
github.com/keysolutions/playagain/htmlutil | |
github.com/keysolutions/playagain/show | |
github.com/kitschysynq/befulnes | Package befulnes provides tools for generating and working with lists of nonsense words |
github.com/klauspost/intrinsics | |
github.com/koffeinsource/kaffeeshare/extract/plugins | |
github.com/Komosa/cf | |
github.com/kortschak/zalgo | Package zalgo implements a zalgo text io.Writer. |
github.com/korya/html2text | Golang HTML to plaintext conversion library |
github.com/kubernetes/kubernetes/pkg/util/proxy | Package proxy provides transport and upgrade support for proxies |
github.com/kuun/goread | google reader clone in go on app engine |
github.com/kuun/goread/sanitizer | |
github.com/laher/opengraph | Package opengraph extracts Open Graph metadata from html documents. |
github.com/lestrrat/go-mechanize | WWW::Mechanize for Go |
github.com/lestrrat/go-mechanize/query | |
github.com/linkosmos/tokeq | Match & Callback iterator for html.Node - aimed for performance & HTML document reusability |
github.com/linkosmos/tokeq/example | |
github.com/linkosmos/tokeq/nodes | |
github.com/linuxerwang/goats-html | goats-html is an atribute-based template system specialized for HTML. |
github.com/linuxerwang/goats-html/processors | |
github.com/localvar/makeepub | makeepub document |
github.com/loic-fejoz/microformat-golang-toolbox | A Golang microformats 2 parser |
github.com/lucy/tewibot/modules/net/urltitle | |
github.com/Luminarys/gochat/modules | |
github.com/mantyr/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/marcosinger/GoOse | Html Content / Article Extractor in Golang |
github.com/mars9/crawler | Package crawler provides a crawler implementation. |
github.com/Masterminds/engine/form | Package form provides utilities for creating and accessing HTML forms. |
github.com/mattn/rebuildfm | |
github.com/mauidude/go-readability | A Go implementation of the readability algorithm by arc90 labs |
github.com/maxymania/scrapland/htmlscrape | This is an utility library for use with golang.org/x/net/html. |
github.com/maxymania/scrapland/webscrape | |
github.com/mb0/vmail/feeds | |
github.com/mcoffin/mcoffinbot | |
github.com/mdevilliers/kubernetes/pkg/util/proxy | Package proxy provides transport and upgrade support for proxies |
github.com/mdigger/md2epub | |
github.com/mechanoid/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/meoow/nodefinder | This library finds all nodes in html file given specific path. |
github.com/meoow/nodefinder/cmd | |
github.com/MessageDream/drift/modules/base | |
github.com/mewkiz/pkg/htmlutil | Package htmlutil implements some html utility functions. |
github.com/mewkiz/pkg/httputil | Package httputil implements some http utility functions. |
github.com/mewmew/playground/cmd/links | links examines HTML files and reports invalid links. |
github.com/michigan-com/newsfetch/extraction/recipe_parsing | |
github.com/microcosm-cc/bluemonday | Package bluemonday provides a way of describing a whitelist of HTML elements and attributes as a policy, and for that policy to be applied to untrusted strings from users that may contain markup. |
github.com/microcosm-cc/microcosm/models | Package models provides interfaces to perform CRUD actions to underlying data and a LRU cache layer for data returned. |
github.com/mikecb/goread | google reader clone in go on app engine |
github.com/mikecb/goread/sanitizer | |
github.com/mikedanese/kubernetes/pkg/util/proxy | Package proxy provides transport and upgrade support for proxies |
github.com/misrab/goutils | |
github.com/mnhkahn/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/mnhkahn/maodou | A search engine written in golang. |
github.com/mnhkahn/maodou/parser | |
github.com/mnhkahn/maodou/request/proxy | |
github.com/mohae/rancher/app | |
github.com/moqada/gotaku | :fish: Web GYOTAKU Client for Go |
github.com/morephp/gocrawl | gocrawl is a polite, slim and concurrent web crawler written in Go. |
github.com/namsral/microdata | Go package for extracting HTML Microdata |
github.com/neocortical/gsoup | Package gsoup provides HTML sanitization functionality on top of Go's html package |
github.com/n/golibs/net/html | |
github.com/nightowlware/cybergrab | A lean web-crawler library, written in Go. |
github.com/nilslice/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/nmeum/marvin/modules/url | |
github.com/Nvveen/mir/sprinter | Package sprinter implements our fast web crawler. |
github.com/onuryilmaz/eksisozluk-cli | This is the main package for the "eksisozluk-cli" application |
github.com/openshift/kubernetes/pkg/util/proxy | Package proxy provides transport and upgrade support for proxies |
github.com/opinionated/scraper-core/scraper | cookies.go |
github.com/ostovan/yab/feed/rss | |
github.com/owlfish/tal | Package tal implements the TAL template language for generating HTML5 output. |
github.com/pakohan/go-libs/flatscan | |
github.com/pbberlin/tools/net/http/dedup | package dedup takes multiple dom instances, computing similar subtrees measured by levenshtein distance. |
github.com/pbberlin/tools/net/http/dom | Package dom supplies simple node manipulations. |
github.com/pbberlin/tools/net/http/domclean1 | Package domclean1 normalizes html dom trees in a primitive way. |
github.com/pbberlin/tools/net/http/domclean2 | Package domclean2 normalizes html dom trees; structure and formatting are simplified. |
github.com/pbberlin/tools/net/http/proxy1 | Package proxy1 forwards html pages, simplifying their dom structure; it is a wrapper around domclean2 for actual cleansing and proxification; containing tamper-monkey javascript popup code. |
github.com/pbberlin/tools/net/http/repo | Package repo takes http JSON commands; downloading html files in parallel from the designated source; making them available via quasi-static http fileserver. |
github.com/pbberlin/tools/net/http/tokenize | Package tokenize is a try in splitting a html file into tokens, prior to building a dom. |
github.com/pbberlin/tools/os/osutilpb | package osutilpb creates ZIP files, retrieves first directory, executes commands with exit code, scans directory for filetypes, writes a DOM to file, gets directory of go executable, demonstrates filesystem subtree change watching. |
github.com/pedronasser/caddy-search | search middleware for Caddy |
github.com/peterhellberg/stockholmfoodtrucks | A Stockholm food trucks client written in Go |
github.com/philipjkim/goreadability | Port of arc90's readability project to Go |
github.com/phynalle/kiteroot | Convenient helper to play with HTML tag for Go |
github.com/pierrre/mangadownloader | A manga downloader written in Go |
github.com/pilu/feedfinder | Package feedfinder - autodiscovers feed url on web pages. |
github.com/PuerkitoBio/gocrawl | Package gocrawl is a polite, slim and concurrent web crawler written in Go. |
github.com/PuerkitoBio/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/rabidgremlin/commonmarkext | Extension to go-commonmark package to allow for plug-able output generators. |
github.com/rainycape/gondola/social/pinterest | |
github.com/ratrocket/goread | google reader clone in go on app engine |
github.com/ratrocket/goread/sanitizer | |
github.com/raymondbutcher/tidyhtml | Package tidyhtml cleans up HTML input and outputs a tidy version. |
github.com/reusee/nw | html node walker |
github.com/rmoore/xmpp-client | |
github.com/rochaporto/ezgliding/soaringweb | Package soaringweb provides functionality to fetch and parse airspace information, taking the international soaringweb db as input. |
github.com/rohanthewiz/go_markdown | Blackfriday markdown processor. |
github.com/rojters/opengraph | Package opengraph extracts Open Graph metadata from HTML documents. |
github.com/rustyoz/knd | |
github.com/sajari/docconv | Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text |
github.com/samcday/go-dash-javadocset | Go implementation of Kapeli/javadocset |
github.com/schachmat/mvgo/backends | |
github.com/scjalliance/bluebeamlicense | Bluebeam License Lookup |
github.com/sdgoij/ghttpfs | |
github.com/sdolard/cornercheck | Package cornercheck collect le bon coin data |
github.com/sdolard/cornercheck/annonce | |
github.com/seletskiy/go-android-rpc/api-doc-parser | |
github.com/sgoertzen/html2text | An html to text converter written in Go. |
github.com/shamsher31/goweblinks | This will give you list of web links from given webpage. |
github.com/shiyanhui/TextUI/layout | |
github.com/shurcooL/frontend/checkbox | |
github.com/shurcooL/frontend/select_menu | |
github.com/shurcooL/github_flavored_markdown | Package github_flavored_markdown provides a GitHub Flavored Markdown renderer with fenced code block highlighting, clickable header anchor links. |
github.com/shurcooL/go/gopherjs_http | Package gopherjs_http provides helpers for compiling Go using GopherJS and serving it over HTTP. |
github.com/shurcooL/go/html_table | Package html_table provides WalkRows to walk the rows of an html table. |
github.com/shurcooL/go/html_to_markdown | Package html_to_markdown converts HTML nodes to Markdown. |
github.com/shurcooL/gtdo | gtdo is the source for gotools.org. |
github.com/shurcooL/gtdo/page | Package page contains code to render pages that can be used from backend and frontend. |
github.com/shurcooL/htmlg | Package htmlg contains helper funcs for generating HTML nodes and rendering them. |
github.com/shurcooL/issuesapp | Package issuesapp is a web frontend for an issues service. |
github.com/shurcooL/play/120 | Play with "testing/quick" package to find an input that causes html.Parse to return an error. |
github.com/shurcooL/play/121 | Play with go/doc.ToHTML and html_to_markdown. |
github.com/shurcooL/play/133 | |
github.com/shurcooL/play/134 | |
github.com/shurcooL/play/148/pages | Package pages contains code to render pages, used from backend and frontend. |
github.com/shurcooL/play/151/c/htmlg | Package htmlg contains helper funcs for generating HTML nodes and rendering them. |
github.com/shurcooL/play/170 | Play with a react-like Render method that generates HTML for a page. |
github.com/shurcooL/play/171 | |
github.com/shurcooL/play/175/example1 | |
github.com/shurcooL/play/176 | Play with an experimental web server that generates HTML pages in a type safe way on the backend only. |
github.com/shurcooL/play/177 | Play with an experimental web server that generates HTML pages in a type safe way on the frontend only. |
github.com/shurcooL/play/178 | Play with user logins and sessions. |
github.com/shurcooL/play/187 | Play with creating an AI player of tic-tac-toe. |
github.com/shurcooL/play/84 | |
github.com/shurcooL/resume | Package resume is Dmitri Shuralyov's résumé. |
github.com/sillydong/gogs/modules/base | |
github.com/sindbis/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/sisteamnik/GoOse | Html Content / Article Extractor in Golang |
github.com/sisteamnik/go-readability | A Go implementation of the readability algorithm by arc90 labs |
github.com/Skipor/imgserver | Test task for Yandex internship. |
github.com/slyrz/newscat/html | |
github.com/slyrz/newscat/model | |
github.com/smithfox/cascadia | The cascadia package is an implementation of CSS selectors. |
github.com/solidfoxrock/simple-crawler | |
github.com/sourcegraph/htmlclean | Package htmlclean sanitizes HTML based on a tag and attribute whitelist. |
github.com/spiffytech/bible_fixer | |
github.com/sprungknoedl/raup.io | |
github.com/sridif/gocrawl | gocrawl is a polite, slim and concurrent web crawler written in Go. |
github.com/sshaw/glc | The GitHub Link Checker. |
github.com/stayradiated/veehd | |
github.com/stilvoid/please/parsers | Package parsers houses the build-in parser functions that are registered with please |
github.com/studygolang/cascadia | The cascadia package is an implementation of CSS selectors. |
github.com/suapapa/go_subtitle | Package subtitle reads subtitle book from srt and smi file. |
github.com/subosito/shorticon | A simple package for fetching a website's favicon. |
github.com/suzuken/GoOse | Package goose is a golang port of "Goose" originally licensed to Gravity.com under one or more contributor license agreements. |
github.com/sycamoreone/xmpp-client | |
github.com/taylorchu/h5 | |
github.com/tborg/metascraper | Metascraper is a web scraping utility. |
github.com/tbruyelle/emojis | Emojis utils |
github.com/tcnksm/go-latest | go-latest is pacakge to check a provided version is latest from various sources. |
github.com/technosophos/dashing | |
github.com/tengteng/gocrawl | gocrawl is a polite, slim and concurrent web crawler written in Go. |
github.com/thatguystone/swan | Package swan implements the Goose HTML Content / Article Extractor algorithm. |
github.com/thinxer/ggfetch | |
github.com/thinxer/go-htmlutil | A collection of helper methods for go.net/html |
github.com/thomasf/internet | Package internet produces queryable information about the internet by fetching and downloading ripe BGP dumps and cidr-report.org data into redis databases. |
github.com/tshprecher/gopl/sudoku/io | Package io implements logic for downloading puzzles from websudoku.com and writing/reading puzzles to/from files. |
github.com/tsingson/gocrawl | gocrawl is a polite, slim and concurrent web crawler written in Go. |
github.com/unixpickle/gscrape | Scraping Google websites behind a login |
github.com/urandom/readeef/parser/processor | |
github.com/vanng822/go-premailer/premailer | Package premailer is for inline styling. |
github.com/vaz-ar/goxxx/modules/webinfo | Package webinfo retrieves informations from links |
github.com/vdobler/ht/ht | Package ht provides functions for easy testing of HTTP based protocols. |
github.com/vdobler/ht/recorder | Package recorder allows to capture request/response pairs via a reverse proxy and generate tests for these pairs. |
github.com/velour/feedme/webfeed | |
github.com/Ventmere/sitemap/walker | |
github.com/vp89/goffscraper | |
github.com/wcl48/aws-sdk-go/model | Package model contains functionality to generate clients for AWS APIs. |
github.com/weisjohn/htmlrefs | Package htmlrefs returns a slice of `Reference{URI, Token string}`s from an `io.Reader`. |
github.com/wyntr/opengraph | Package opengraph extracts Open Graph metadata from HTML documents. |
github.com/Xenograph/fbparse | A Go library containing tools for parsing Facebook data archives |
github.com/xwz/openid.go | OpenID consumer implementation in Go - golang |
github.com/yahoo/gryffin/html-distance | Package html-distance is a go library for computing the proximity of the HTML pages. |
github.com/yahoo/gryffin/renderer | |
github.com/yhat/middleware | Middleware for the net/http library |
github.com/yhat/scrape | Package scrape provides a searching api on top of golang.org/x/net/html |
github.com/yhat/scrape/example | |
github.com/yogyrahmawan/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/yohcop/openid-go | OpenID consumer implementation in Go - golang |
github.com/yosssi/gohtml | Package gohtml provides an HTML formatting function. |
github.com/zclark/pimports | Analyzes your polymer files for missing or unused import links |
github.com/zhangheli/goquery | Package goquery implements features similar to jQuery, including the chainable syntax, to manipulate and query an HTML document. |
github.com/zhuharev/GoOse | Html Content / Article Extractor in Golang |
gnd.la/social/pinterest | |
golang.org/x/net/html/charset | Package charset provides common text encodings for HTML documents. |
golang.org/x/tools/cmd/html2article | This program takes an HTML file and outputs a corresponding article file in present format. |
go.marzhillstudios.com/pkg/go-html-transform/css/selector | Package selector contains a css3 selector parser. |
go.marzhillstudios.com/pkg/go-html-transform/h5 | Package h5 implements a wrapper and DSL for golang.org/x/net/html. |
go.marzhillstudios.com/pkg/go-html-transform/html/transform | Package transform implements a html css selector and transformer. |
gopkg.in/andybalholm/cascadia.v0 | The cascadia package is an implementation of CSS selectors. |
gopkg.in/antoineaugusti/crawler.v0/fetchers | |
gopkg.in/AudioAddict/xmlpath.v2 | Package xmlpath implements a strict subset of the XPath specification for the Go language. |
gopkg.in/ddo/pick.v0 | |
gopkg.in/ddo/pick.v1 | |
gopkg.in/felixrabe-go/html.v0 | |
gopkg.in/go-xmlpath/xmlpath.v2 | Package xmlpath implements a strict subset of the XPath specification for the Go language. |
gopkg.in/kubernetes/kubernetes.v0/pkg/util/proxy | Package proxy provides transport and upgrade support for proxies |
gopkg.in/kubernetes/kubernetes.v1/pkg/util/proxy | Package proxy provides transport and upgrade support for proxies |
gopkg.in/linkosmos/tokeq.v0 | |
gopkg.in/linkosmos/tokeq.v0/example | |
gopkg.in/linkosmos/tokeq.v0/nodes | |
gopkg.in/xmlpath.v2 | Package xmlpath implements a strict subset of the XPath specification for the Go language. |
gopl.io/ch5/findlinks1 | Findlinks1 prints the links in an HTML document read from standard input. |
gopl.io/ch5/findlinks2 | Findlinks2 does an HTTP GET on each URL, parses the result as HTML, and prints the links within it. |
gopl.io/ch5/links | Package links provides a link-extraction function. |
gopl.io/ch5/outline | Outline prints the outline of an HTML document tree. |
gopl.io/ch5/outline2 | Outline prints the outline of an HTML document tree. |
gopl.io/ch5/title1 | Title1 prints the title of an HTML document specified by a URL. |
gopl.io/ch5/title2 | Title2 prints the title of an HTML document specified by a URL. |
gopl.io/ch5/title3 | Title3 prints the title of an HTML document specified by a URL. |
h12.me/html-query | Package query provides a fluent and functional interface for querying HTML DOM using Go. |
h12.me/html-query/expr | |
k8s.io/kubernetes/pkg/util/proxy | Package proxy provides transport and upgrade support for proxies |
kythe.io/kythe/go/util/html | Package html is a set of utilities for manipulating html Nodes. |
sourcegraph.com/sourcegraph/htmlclean | Package htmlclean sanitizes HTML based on a tag and attribute whitelist. |
sourcegraph.com/sourcegraph/sourcegraph/sourcecode | Package sourcecode contains formatting and parsing routines for source code. |
sourcegraph.com/sourcegraph/sourcegraph/util/htmlutil | |
src.sourcegraph.com/apps/tracker | Package tracker is a thread tracker app. |
src.vecty.io/vecty | |
willnorris.com/go/gum | Package gum provides the gum personal short URL resolver. |
willnorris.com/go/webmention | Package webmention provides functions for discovering the webmention endpoint for URLs, and sending webmentions according to http://webmention.org/. |
xi2.org/x/genman | Genman is a man page generator for Go command line programs. |
xi2.org/x/htmlnode | Package htmlnode provides functions for searching, traversing and printing parsed HTML. |