File:MACHINE LEARNING OF EXTREMELY LARGE SETS OF SIGNAL COLLECTIONS USING CLUSTER COMPUTING (IA machinelearningo1094564153).pdf

Size of this JPG preview of this PDF file: 463 × 599 pixels. Other resolutions: 185 × 240 pixels | 371 × 480 pixels | 593 × 768 pixels | 1,275 × 1,650 pixels.

Original file ‎(1,275 × 1,650 pixels, file size: 3.23 MB, MIME type: application/pdf, 90 pages)

Captions

English

Add a one-line explanation of what this file represents

Summary[edit]

MACHINE LEARNING OF EXTREMELY LARGE SETS OF SIGNAL COLLECTIONS USING CLUSTER COMPUTING ( )
Author	Ferris, Christopher L.
Title	MACHINE LEARNING OF EXTREMELY LARGE SETS OF SIGNAL COLLECTIONS USING CLUSTER COMPUTING
Publisher	Monterey, CA; Naval Postgraduate School
Description	Multitudes of signals are transmitted over the airwaves at any given moment, creating a large intelligence opportunity and reconnaissance problem. As technology advances, cluster computing methods must be explored to fill the intelligence gap caused by an increasingly large amount of data and a limited number of human analysts. In this thesis, Apache HBase, Phoenix, and Spark are employed on an AWS EMR cluster to store, query, and implement the K-means machine learning algorithm on a large-scale signals database. The signal databases tested consist of up to 100 million randomly generated signals, with nine feature columns of metadata. The signal data set is first bulk-loaded into HBase and a Phoenix layer is implemented. The data is then queried from Spark into a Dataframe for machine learning implementation. Additionally, the K-means implementations are run on multiple different computer-cluster configurations to test performance as a function of the number of computers in the cluster. This thesis demonstrates the capabilities and benefits of utilizing open-source software and cluster computing to implement large-scale machine learning on signal metadata. Subjects: machine learning; cluster computing; signal collection; signal analysis
Language	English
Publication date	December 2019
Current location	IA Collections: navalpostgraduateschoollibrary; fedlink
Accession number	machinelearningo1094564153
Source	Internet Archive identifier: machinelearningo1094564153 https://archive.org/download/machinelearningo1094564153/machinelearningo1094564153.pdf
Permission (Reusing this file)	This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States.

Licensing[edit]

	This work is in the public domain in the United States because it is a work prepared by an officer or employee of the United States Government as part of that person’s official duties under the terms of Title 17, Chapter 1, Section 105 of the US Code. Note: This only applies to original works of the Federal Government and not to the work of any individual U.S. state, territory, commonwealth, county, municipality, or any other subdivision. This template also does not apply to postage stamp designs published by the United States Postal Service since 1978. (See § 313.6(C)(1) of Compendium of U.S. Copyright Office Practices). It also does not apply to certain US coins; see The US Mint Terms of Use.
This file has been identified as being free of known restrictions under copyright law, including all related and neighboring rights.

PDMCreative Commons Public Domain Mark 1.0falsefalse

File history

Click on a date/time to view the file as it appeared at that time.

	Date/Time	Thumbnail	Dimensions	User	Comment
current	16:55, 22 July 2020		1,275 × 1,650, 90 pages (3.23 MB)	Fæ (talk \| contribs)	FEDLINK - United States Federal Collection machinelearningo1094564153 (User talk:Fæ/IA books#Fork8) (batch 1993-2020 #20989)

You cannot overwrite this file.

File usage on Commons

The following page uses this file:

File:MACHINE LEARNING OF EXTREMELY LARGE SETS OF SIGNAL COLLECTIONS USING CLUSTER COMPUTING (IA machinelearningo1094564153).pdf

Metadata

This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.

Short title	MACHINE LEARNING OF EXTREMELY LARGE SETS OF SIGNAL COLLECTIONS USING CLUSTER COMPUTING
Image title
Author	Ferris, Christopher L.
Software used	Ferris, Christopher L.
Conversion program	Adobe PDF Library 11.0
Encrypted	no
Page size	612 x 792 pts (letter)
Version of PDF format	1.4

File:MACHINE LEARNING OF EXTREMELY LARGE SETS OF SIGNAL COLLECTIONS USING CLUSTER COMPUTING (IA machinelearningo1094564153).pdf

Captions

Captions

Summary[edit]

Licensing[edit]

File history

File usage on Commons

Metadata

Structured data

Items portrayed in this file

depicts

Navigation menu

File:MACHINE LEARNING OF EXTREMELY LARGE SETS OF SIGNAL COLLECTIONS USING CLUSTER COMPUTING (IA machinelearningo1094564153).pdf

Captions

Captions

Summary[edit]

Licensing[edit]

File history

File usage on Commons

Metadata

Navigation menu

Search