Universal Content Importer is a dashboard utility that utilises Universal Content Puller to pull content and files in bulk from any site. Corresponding pages are created within the ConcreteCMS site, with the pulled content added to blocks on those pages.
Where blocks cannot be fully added, UCI provides a comprehensive notification and review system to facilitate manual addition of that last little bit.
Once installed, you will have:
Universal Content Importer uses DOM selectors to home in on the section(s) of the source site to import. Thus it requires a source site that has some regularity in the way its DOM is organised. This is a reasonably dependable assumption for sites that were generated by a CMS or other site generator, but may become less dependable for sites of hand-written html.
Where blocks cannot be automatically generated, UCI places a Notification block summarising what review and completion actions are required at the top of the imported area. UCI then inserts Placeholder blocks into the page where blocks need to be added manually.
UCI will not import a source site fully automatically, but if it can do 90% of the work for you and point you to where additional work is required, reviewing and fixing the last 10% can still save many days or even weeks of editing pages when converting a large site to ConcreteCMS.
During the review process, the UCI blocks can be manually removed from individual pages as they are completed. Alternatively, they can be left in place and hidden from visitors using Advanced Permissions, or the UCI blocks can be left in place and removed in bulk by uninstalling the blocks or UCI in the dashboard.
Once the import process is completed and fully reviewed, Universal Content Importer can be safely uninstalled and is in no way required for continued operation of the site.
Just because you can import content in bulk from another site does not mean you can legally do so. Plese respect the copyright of others. Only import content where:
Universal Content Importer is built on top of Universal Content Puller and designed for extension using similar mechanisms. Content sources, transforms and displays follow a pluggable and extensible architecture for easy integration of further sources, transforms and displays from within UCP or provided by third party packages or your own application specific plugins.
Universal Content Importer is only available direct from JohntheFish. Please contract me to discuss.
A general purpose content importer for batch importing pages and images from any site. Internally uses Universal Content Puller, so installing Universal Content Puller is a prerequisite.
/dashboard/blocks/universal_content_puller/universal_content_importer
A general purpose content importer for batch importing pages and images from any site. Internally uses Universal Content Puller, so installing Universal Content Puller is a prerequisite.
/dashboard/pages/universal_content_importer
A general purpose content importer for batch importing pages and images from any site. Internally uses Universal Content Puller, so installing Universal Content Puller is a prerequisite.
/dashboard/pages/universal_content_importer/settings
Settings for each stage of the importer process.
/dashboard/pages/universal_content_importer/settings/import_remote_sitemap
Import a remote sitemap into the UCI workspace to pull content from.
/dashboard/pages/universal_content_importer/settings/grab_remote_pages
Add HTML content to the UCI workspace.
/dashboard/pages/universal_content_importer/settings/metadata
Optionally add further page data not provided by Import Remote Sitemap
/dashboard/pages/universal_content_importer/settings/create_pages
Create or update local concrete5 pages based on remote URLs. Blocks are added later.
/dashboard/pages/universal_content_importer/settings/list_images
Extract a list of images required for URLs in the UCI Workspace.
/dashboard/pages/universal_content_importer/settings/import_images
Import images to the concrete5 File Manager based on image URLs listed in the UCI Workspace.
/dashboard/pages/universal_content_importer/settings/list_documents
Extract a list of documents linked by URLs in the UCI Workspace.
/dashboard/pages/universal_content_importer/settings/import_documents
Import documents to the concrete5 File Manager based on document URLs listed in the UCI Workspace.
/dashboard/pages/universal_content_importer/settings/extract_blocks_from_pages
HTML grabbed from remote pages is parsed to extract Content and other blocks.
/dashboard/pages/universal_content_importer/settings/add_blocks_to_pages
Extracted blocks are added to local pages.
Pull content from many sources and display it in many ways.
ConcreteCMS Marketplace v9.3.24
/dashboard/blocks/universal_content_puller
Pull content from many sources and display it in many ways.
/dashboard/blocks/universal_content_puller/plugins
Plugins for the Universal Content Puller block
/dashboard/blocks/universal_content_puller/global_settings
Edit global settings and defaults for the Universal Content Puller block.
Sources extension for Universal Content Puller. The sources in this extension are XX because they are the kind of source you may not want to let just anyone loose with, hence a separate package so they don't have to be installed with less sensitive sources.
ConcreteCMS Marketplace v9.3.24
Most of this sidebar is built using Universal Content Puller.
The Content Source is Page Area, set to pull the Sidebar area the Universal Content Puller page and within that sliced to just the Page List.
The Content Transform is Selector, set to remove container and row classes that, when unnecessarily nested, could mess up the Bootstrap grid. The Content Display is Plain, which just outputs the transformed text.
In the advanced settings, sanitization is disabled as we trust the source page and don't want to strip out any formatting or functionality from the pulled sidebar.